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RG NUCLEIC ACroS FOR COhfl'EIWmG DISEASE RESISTANCE TO PLANTS 

5 

The present application is a continuation-in-part application ("dP") of U.S. 
Patent Application Serial No. ("USSN") 08/781.734, filed January 10, 1997. The 
10 aforementioned application is explicitly incorporated herein by reference in its entirety and 
for all purposes. 

This invention was made with Government support under Grant Nos. 92- 
37300-7547 and 95-37300-1571, awarded by the United States Department of Agriculture. 
15 The Government has certain rights in this invention. 

FIELD OF THE INVENTION 
The present invention relates generally to plant molecular biology. In 
particular, it relates to nucleic acids and methods for conferring pest resistance in plants. 
20 panicularly lettuce. 

BACKGROUND OF THE INVENTION 
Recently, several resistance genes have been cloned by several groups from 
several plants. Many of fliese genes are sequence related. The derived amino acid 
25 sequences of the most common class, RPS2, RPMl (bacterial resistances in Arabidopsis 
(Mindrinos et al Cell 78:1089-1099 (1994)); Bent et al Science 265:1856-1860 (1994); 
Grant et al. Science 269:843-846 (1995)), L6 (fungal resistance in flax; Lawrence, et al. 
The Plant Cell 7:1195-1206 (1995)), and M (virus resistance in tobacco; Whitiiam, et al. 
Cell 78:1101-1115 (1994); and U.S. Patent No. 5,571.706), all contain leucine-rich 
30 repeats (LRR) and nucleotide binding sites (NBS). 

The NBS is a common motif in several mammalian gene families encoding 
signal transduction components (e.g., Ras) and is associated with ATP/GTP-binding sites. 
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The NBS is a common motif in several mammalian gene fanulies encoding . 
signal transduction components (e.g.. Ras) and is associated with ATP/GTP-binding sites. 

LRR domains can mediate protein-protein interactions and are found in a 
variety of proteins involved in signal transduction, cell adhesion and various other 
functions. LRRs are leucine rich regions often comprising 20-30 amino acid repeats 
where leucine and other aliphatic residues occur periodically. URRs can fimction 

extracellularly or intracellularly. 

Since the onset of civilization, plant diseases have had catastrophic effects 
on crops and the well-being of the human population. Plant diseases continue to effect 
enormous human and economic costs. An increasing human population and decreasing 
amounts of arable land make all approaches to preventing and treating plant pathogen 
destruction critical. The ability to control and enhance a plant's protective responses 
against pathogens would be of enormous benefit. Tissue-specific and temporal control of 
mechanisms responsible for plant cell death would also be of great practical and economic 
value. The present invention fulfills tiiese and other needs. 

What is needed in flie art are plant disease resistance genes and means to 
create transgenic disease resistance plants, particularly in lettuce. Further, what is needed 
in the art is a means to DNA fingerprint cultivars and germplasm with respect to flieir 
disease resistance haplotypes for use in plant breeding programs. The present invention 
provides these and other advantages. 

SUMMARY OF THE INVENTION 

The present invention provides isolated nucleic acid constructs. These 
constructs comprise an RG (resistance gene) polynucleotide which encodes an RG 
polypeptide having at least 60% sequence identity to an RG polypeptide selected from the 
group consisting of: an RGl polypeptide, an RG2 polypeptide, an RG3 polypeptide, and 
an RG4 polypeptide. RGl. RG2. RG3, RG4. and the like, represent individual «RG 
families." Each "RG family." as defined herein, is a group of polypeptide sequences tiiat 
have at least 60% amino acid sequence identity. Individual members of an RG family. 
i.e. . individual species of the genus. typicaUy map to the same genomic locus. The 
invention provides for constructs comprising nucleotides encoding Uie RG families of Uie 
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iavenrion, which can include sequences encoding a leucine rich region (LRR), and/or a 
nucleotide binding site (NBS), or both. 

The invention provides for an isolated nucleic acid construct conq)rising an 
RG polynucleotide which encodes an RG polypeptide having at least 60% sequence 
5 identity to an RG polypeptide from an RG family selected from the group consisting of: an 
RGl polypeptide, an RG2 polypeptide, an RG3 polypeptide, an RG4 polypeptide, an RG5 
polypeptide, and an RG7 polypeptide. In alternative embodiments, the nucleic acid 
construct comprises an RG polynucleotide which encodes an RG polypeptide comprising 
an leucine rich region (LRR), or, an RG polypeptide comprising a nucleotide binding site 

10 (NBS). The nucleic acid construct can comprise a polynucleotide which is a full length 
gene. In anotiier embodiment, the nucleic acid construct encodes a fusion protein. 

In one embodiment, the nucleic acid construct comprises a sequence 
encoding an RGl polypeptide. The RGl polypeptide can be encoded by a polynucleotide 
sequence selected from the group consisting of SEQ ID N0:1 (RGl A), SEQ ID N0:2 and 

15 SEQ ID NO: 137 (RGIB). SEQ ID NO: 3 (RGIC), SEQ ID N0:4 (RGID), SEQ ID N0:5 
(RGIE), SEQ ID N0:6 (RGIF), SEQ ID N0:7 (RGIG). SEQ ID N0:8 (RGIH), SEQ ID 
N0:9 (RGII), and SEQ ID NO: 10 (RGl J). 

In another embodiment, the nucleic acid construct comprises a sequence 
encoding an RG2 polypeptide. The RG2 polypeptide can be encoded by a polynucleotide 

20 sequence selected from the group consisting of: SEQ ID N0:21 and SEQ ID NO:27 

(RG2A); SEQ ID NO:23 and SEQ ID NO:28 (RG2B); SEQ ID NO:29 (RG2C); SEQ ID 
NO:30 (RG2D); SEQ ID N0:31 (RG2E); SEQ ID NO:32 (RG2F); SEQ ID NO:33 
(RG2G); SEQ ID NO:34 (RG2H); SEQ ID NO:35 (RG2I); SEQ ID NO:36 (RG2J); SEQ 
ID NO:37 (RG2K); SEQ ID NO:38 (RG2L); SEQ ID NO:39 (RG2M); SEQ ID NO:87 

25 (RG2A); SEQ ID NO:89 (RG2B); SEQ ID N0:91 (RG2C); SEQ ID NO:93 (RG2D) and 
SEQ ID NO:94 (RG2D); SEQ ID NO:96 (RG2E); SEQ ID NO:98 (RG2F); SEQ ID 
NO: 100 (RG2G); SEQ ID NO: 102 (RG2H); SEQ ID NO: 104 (RG2I); SEQ ID NO: 106 
(RG2J) and SEQ ID NO: 107 (RG2J); SEQ ID NO: 109 (RG2K) and (SEQ ID NO: 110 
(RG2K); SEQ ID N0:112 (RG2L); SEQ ID NO:114 (RG2M); SEQ ID NO:116 (RG2N); 

30 SEQ ID NO: 118 (RG20); SEQ ID NO: 120 (RG2P); SEQ ID NO: 122 (RG2Q); SEQ ID 
.NO:124 (RG2S); SEQ ID NO:126 (RG2T); SEQ ID NO:128 (RG2U); SEQ ID NO:130 
(RG2V); and, SEQ ID NO: 132 (RG2W). 
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In Other embodiments, the nucleic acid construct comprises a RG3 sequence • 
(SEQ ID NO:68) encoding an RG3 polypeptide (SEQ ID NO:138) (RG3). la other 
embodiments, the nucleic acid construct comprises an RG4 sequence (SEQ ID NO:69) 
encoding an RG4 polypeptide ( SEQ ID NO:139) (RG4). 
5 In other embodiments, the nucleic acid construct comprises a RG5 sequence 

( SEQ ID NO:134) encoding an RG5 polypeptide ( SEQ ID NO:135). The RG5 
polypeptide can be encoded by a polynucleotide sequence as set forth in SEQ ID NO:134. 

The invention also provides for a nucleic acid constnict which comprises an 
RG7 sequence encoding an RG7 polypeptide. The RG7 polypeptide can be encoded by a 
10 polynucleotide sequence as set forth in SEQ ID NO:136. 

In fiirther embodiments, die nucleic acid construct can further comprise a 
promoter operably linked to the RG polynucleotide. In alternative embodiments, tiie 
promoter can be a plant promoter; a disease resistance promoter; a lemice promoter; a 
constimtive promoter; an inducible promoter; or, a tissue-specific promoter. The nucleic 
15 acid construct can comprise a promoter sequence from an RG gene linked to a 
heterologous polymicleotide. 

The invention also provides for a transgenic plant comprising a recombinant 
expression cassette comprising a promoter operably linked to an RG polynucleotide. The 
expression cassette can comprise a plant promoter or a viral promoter; die plant promoter 
20 can be a heterologous promoter. In one embodiment, the transgenic plant is lettuce. In 
alternative embodiments, the transgenic plant comprises an expression cassette which 
includes an RG polynucleotide selected from the group consisting of SEQ ID N0:1 
(RGIA); SEQ ID N0:2 and SEQ ID NO:137 (RGIB); SEQ ID NO: 3 (RGIC); SEQ ID 
N0:4 (RGID); SEQ ID N0:5 (RGIE); SEQ ID NO:6 (RGIF); SEQ ID N0:7 (RGIG); 
25 SEQ ID N0:8 (RGIH); SEQ ID N0:9 (RGII) and SEQ ID NO:10 (RGIJ); SEQ ID 

N0:21 and SEQ ID NO:27 (RG2A); SEQ ID NO:23 and SEQ ID NO:28 (RG2B); SEQ ID 
NO:29 (RG2C); SEQ ID NO:30 (RG2D); SEQ ID N0:31 (RG2E); SEQ ID NO:32 
(RG2F); SEQ ID NO:33 (RG2G); SEQ ID NO:34 (RG2H); SEQ ID NO:35 (RG2I); SEQ 
ID NO:36 (RG2J); SEQ ID NO:37 (RG2K); SEQ ID NO:38 (RG2L); SEQ ID NO:39 
30 (RG2M); SEQ ID NO:87 (RG2A); SEQ ID NO:89 (RG2B); SEQ ID N0:91 (RG2C); SEQ 
ID NO:93 (RG2D) and SEQ ID NO:94 (RG2D); SEQ ID NO:96 ( RG2E); SEQ ID 
NO:98 (RG2F); SEQ ID NO:100 (RG2G); SEQ ID NO:102 (RG2H); SEQ ID N0:104 
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(RG2I); SEQ ro NO: 106 (RG2J) and SEQ ID NO: 107 (RG2J); SEQ ID NO: 109 (RG2K) . 
and (SEQ ID NO:110 (RG2K); SEQ ID N0:112 (RG2L); SEQ ID N0:114 (RG2M); SEQ 
ID NO:116 (RG2N); SEQ ID N0:118 (RG20); SEQ ID NO:120 (RG2P); SEQ ID 
NO:122 (RG2Q); SEQ ID N0:124 (RG2S); SEQ ID N0:126 (RG2T); SEQ ID N0:128 

5 (RG2U); SEQ ID NO:130 (RG2V); and, SEQ ID NO:132 (RG2W); SEQ ID NO:68 
(RG3); SEQ ID NO:69 (RG4); SEQ ID NO:134 (RG5); or SEQ ID NO:136 (RG7). 

The invention provide for a transgenic plant comprising an expression 
cassette comprising an RG polynucleotide which can encode an RGl pol3rpeptide selected 
from the group consisting of SEQ ID NO: 11 (RGIA). SEQ ID NO: 12 (RGIB), SEQ ID 

10 N0:13 (RGIC), SEQ ID N0:14 (RGID), SEQ ID N0:15 (RGIE), SEQ ID N0:16 

(RGIF), SEQ ID N0:17 (RGIG), SEQ ID N0:18 (RGIH), SEQ ID N0:19 (RGII), or 
SEQ ID NO:20 (RGIJ); or, an RG2 polypeptide selected from the group consisting of SEQ 
ID NO:22 and SEQ ID N0:41 (RG2A); SEQ ID NO:24 and SEQ ID N0:42 (RG2B); SEQ 
ID NO:43 (RG2C); SEQ ID NO:44 (RG2D); SEQ ID NO:45 (RG2E); SEQ ID N0:46 

15 (RG2F); SEQ ID NO:47 (RG2G); SEQ ID NO:48 (RG2H); SEQ ID N0:49 (RG2I); SEQ 
ID NO:50 (RG2J); SEQ ID N0:51 (RG2K); SEQ ID N0:52 (RG2L); SEQ ID NO:53 
(RG2M); SEQ ID N0:88 (RG2A); SEQ JD NO:90 (RG2B); SEQ ID NO:92 (RG2C); SEQ 
ID NO:95 (RG2D); SEQ ID NO:97 ( RG2E); SEQ ID NO:99 (RG2F); SEQ ID NO:101 
(RG2G); SEQ ID NO:103 (RG2H); SEQ ID NO:105 (RG2I); SEQ ID NO:108 (RG2J); 

20 SEQ ID N0:111 (RG2K); SEQ ID N0:113 (RG2L); SEQ ID N0:115 (RG2M); SEQ ID 
NO: 117 (RG2N); SEQ ID NO: 119 (RG20); SEQ ID NO: 121 (RG2P); SEQ ID NO: 123 
(RG2Q); SEQ ID NO: 125 (RG2S); SEQ ID NO: 127 (RG2T); SEQ ID NO: 129 (RG2U); 
SEQ ID N0:131 (RG2V); and, SEQ ID NO:133 (RG2W); an RG4 polypeptide as set forth 
by SEQ ID NO:72; an RG5 polypeptide with a sequence as set forth by SEQ ID NO: 135; 

25 or, an RG7 polypeptide. 

The invention also provides for a method of enhancing disease resistance in 
a plant, the method con^rising introducing mto the plant a recombinant expression cassette 
comprising a promoter functional in the plant and operably linked to an RG polynucleotide 
sequence. In this method, the plant can be a lettuce plant; and, the RG polynucleotide can 

30 encode an RG polypeptide selected from the group consisting of an RGl polypeptide 

selected from the group consisting of SEQ ID N0:11 (RGIA), SEQ ID N0:12 (RGIB), 
SEQ ID NO: 13 (RGIC). SEQ ID NO: 14 (RGID), SEQ ID NO: 15 (RGIE), SEQ ID 
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N0:16 (RGIF). SEQ ID N0:17 (RGIG). SEQ ID N0:18 (RGIH), SEQ ID N0:19 
(RGID, or SEQ ID NO:20 (RGIJ); or, an RG2 polypeptide selected from the group 
consisting of SEQ ID NO:22 and SEQ ID N0:41 (RG2A); SEQ ID NO:24 and SEQ ID 
NO-42 (RG2B); SEQ ID NO:43 (RG2C); SEQ ID NO:44 (RG2D); SEQ ID NO:45 
(RG2E); SEQ ID NO:46 (RG2F); SEQ ID NO:47 (RG2G); SEQ ID NO:48 (RG2H); SEQ 
ID NO:49 (RG2I); SEQ ID NO:50 (RG2D; SEQ ID N0:51 (RG2K); SEQ ID NO:52 
(RG2L); SEQ ID NO:53 (RG2M); SEQ ID NO:72; SEQ ID NO:74; SEQ ID NO:88 
(RG2A); SEQ ID NO:90 (RG2B); SEQ ID NO:92 (RG2C); SEQ ID NO:95 (RG2D); 
SEQ ID NO:97 ( RG2E); SEQ ID NO:99 (RG2F); SEQ ID NO:101 (RG2G); SEQ ID 
NO:103 (RG2H); SEQ ID NO:105 (RG2D; SEQ ID NO:108 (RG2J); SEQ ID N0:111 
(RG2K); SEQ ID N0:113 (RG2L); SEQ ID N0:115 (RG2M); SEQ TO N0:117 (RG2N); 
SEQ ID N0:119 (RG20); SEQ ID N0:121 (RG2P); SEQ ID NO:123 (RG2Q); SEQ ID 
NO:125 (RG2S); SEQ ID NO:127 (RG2T); SEQ ID NO:129 (RG2U); SEQ ID N0:131 
(RG2V); and, SEQ ID NO: 133 (RG2W). In this method, the promoter can be a plant 
disease resistance promoter, a tissue-specific promoter, a constimtive promoter, or an 

inducible promoter. 

The invention also provides for a method of detecting RG resistance genes 
in a micleic acid sample, tiie method comprising: contacting the nucleic acid sample with 
an RG polynucleotide to form a hybridization complex; and. wherein the formation of die 
hybridization complex is used to detect die RG resistance gene in die nucleic acid sample. 
In this method, die RG polynucleotide can be an RGl polynucleotide, an RG2 
polynucleotide, an RG3 polymicleotide, an RG4 polynucleotide, an RG5 polynucleotide or 
an RG7 polynucleotide. In tiiis raediod. die RG resistance gene can be amplified prior to 
flie step of contacting die nucleic acid sample widi die RG polynucleotide, and, die RG 

resistance gene can be amplified by the polymerase chain reaction. In one embodiment. 

the RG polynucleotide is labeled. 

The invention furtiier provides for an RG polypeptide having at least 60% 

sequence identity to a polypeptide selected from die group consisting of: an RGl 

polypeptide, an RG2 polypeptide, an RG3 polypeptide, an RG4 polypeptide, an RG5 

polypeptide, and an RG7 polypeptide. 
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A further understanding of the nature and advantages of the present 
invention may be realized by reference to the remaining portions of the specification, the 
figures and claims. 

All publications, patents and patent applications cited herein are hereby 
expressly incorporated by reference for all purposes. 

DETAILED DESCRIPTION OF THE INVENTION 

This invention relates to families of RG genes, particularly from Lactuca 
sativa. Nucleic acid sequences of the present invention can be used to confer resistance in 
plants to a variety of pests including viruses, fungi, nematodes, insects, and bacteria. 
Sequences from within the RG genes can be used to fingerprint cultivars or germplasm for 
the presence of desired resistance genes. Promoters of RG genes can be used to drive 
heterologous gene expression under conditions in which RG genes are expressed. Further, 
the present invention provides RG proteins and antibodies specifically reactive to RG 
proteins. Antibodies to RG proteins can be used to detect the type and amount of RG 
protein expressed in a plant sample. 

The present invention has use over a broad range of types of plants, 
including species from the genera Cucurbita, Rosa, Vitis, Juglans, Fragaria, Lotus, 
Medicago, Onobrychis, Trifoliwn, Trigonella, Vigna, Citrus, Linum, Geranium, Manihot, 
Daucus, Arabidopsis, Brassica, Raphanus, Sinapis, Atropa, Capsicum, Datura, 
Hyoscyamus, Lycopersicon, Nicotiana, Solarium, Petunia, Digitalis, Majorana, 
Ciahorium, Helianthus, Lactuca, Bromus, Asparagus, Antirrhinum, Heterocallis, Nemesis, 
Pelargonium, Panieum, Pennisetum, Ranunculus, Senecio, Salpiglossis, Cucumis, 
Browaalia, Glycine, Pisum, Phaseolus, Lolium, Oryza, Zea, Avena, Hordeum, Secale, 
Triticum, and, Sorghum, In particularly preferred embodiments, species from the family 
Compositae and in particular the genus Lactuca are employed such as L. sativa and such 
subspecies as crispa, longifolia, and asparagina. 

The nucleic acids of the present invention can be used in marker-aided 
selection. Marker-aided selection does not require the complete sequence of the gene or 
precise knowledge of which sequence confers which specificity. Instead, partial sequences 
can be used as hybridization probes or as the basis for oligonucleotide primers to amplify 
nucleic acid, e.g., by PGR. Partial sequences can be used in other methods, such as to 
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follow the segregation of chromosome segments containing resistance genes in plants. . 
Because the RG marker is the gene itself, there can be negligible recombination between 
the marker and the resistance phenotype. Thus. RG polynacleotides of the present 
invention provide an optunal means to DNA fingerprint cultivars and wfld germplasm with 
respect to their disease resistance haplotypes. This can be used to mdicate which 
germplasm accessions and cultivars carry the same resistance genes. At present, selection 
of plants (e.g., lettoce) for resistance to some diseases is slow and difficult. But Imted 
markers allow indirect selection for such resistance genes. Moreover, RG markers also 
allow resistance genes to be identified and combined in a mamier that would not otherwise 
be possible. Numerous accessions have been identified that provide resistance to all 
isolates of downy mildew {Bremia lactucae). However, without molecular markers it is 
impossible to combine such resistances from different sources. The nucleic acid sequences 
of the invention provide for a fast and convenient means to identify and combine 
resistances from different sources. The RG markers of the invention can also be used to 
ideniily recombinants that have new combinations of resistance genes in cis on the same 
chromosome. 

In addition, RG markers may allow the identification of the Mendelian 
factors determining traits, such as field resistance to downy mildew. Once such markers 
have been identified, they will greatly increase the ease with which field resistance can be 
transferred between Imes and combined with other resistances. 

fa another application, primers to RG sequences can be also designed to 
ampUfy sequences that are conserved m multiple RG family members. This gives genetic 
information on multiple RG family members. Alternatively, one or more primers can be 
made to sequences unique to a single resistance gene gemis or a single RG specie. This 
aUows an analysis of individual famUy groups (an RG genus) or an individual family 
member (a specie). Primers made to individual RGs at the edge of each cluster can be 
used to select for recombmants within the cluster. This minimizes the amount of luikage 
drag during mtrogression. Classical and molecular genetics has shown that pest resistance 
genes tend to be clustered m the genome. Pest resistance loci comprise arrays of genes 
and exhibit a variety of complex haplotypes rather than being simple alternate allelic 
forms. Pest resistance is conferred by families, or genuses. of related RG sequences, 
indi^^dual members, or species, of which have evolved to have a differem specificity. 
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Oligonucleotide primers can be designed that amplify members from multiple haplotypes, . 
or genuses, or amplify only members of one genus, or only amplify an individual specie. 
This will provide codominant information and allow heterozygotes to be distinguished 
from homozygotes. 

5 Further, comparison of RG sequences will allow a determination of which 

sequences are critical for resistance and will ultimately lead to engineering resistance genes 
with new specificities. Resistance gene sequences were not previously available for 
lettuce. Marker-aided selection will greatly increase the precision and speed of breeding 
for disease resistance. Transgenic approaches will allow pyramiding of resistance genes 

10 into a single Mendelian unit, transfer between sexually-incompatible species, substitute for 
conventional backcrossing procedures, and allow expression of other genes in parallel with 
resistance genes. 

The RG polynucleotides also have utility in the construction of disease 
resistant transgenic plants. This avoids lengthy and sometimes difficult backcrossing 

IS programs currently necessary for introgression of resistance. It is also possible to transfer 
resistance polynucleotides between sexually-incompatible species, thereby greatly 
increasing the germplasm pool that can be used as a source of resistance genes. Cloning of 
multiple RG sequences in a single cassette will allow pyramiding of genes for resistance 
against multiple isolates of a single pathogen such as downy mildew or against multiple 

20 pathogens. Once introduced, such a cassette can be manipulated by classical breeding 
methods as a single Mendelian unit. 

Transgenic plants of the present mvention can also be constructed using an 
RG promoter. The promoter sequences from RG sequences of the invention can be used 
with RG genes or heterologous genes. Thus, RG promoters can be used to express a 

25 variety of genes in the same tenq)oral and spatial patterns and at similar levels to resistance 
genes. 

Nycleiic acids of the Inventiop and Their Preparation 

RG Polynucleotide Families 
30 The present invention provides isolated nucleic acid constructs which 

comprise an RG polynucleotide. In alternative embodiments, the RG polynucleotide is at 
least 18 nucleotides in length, typically at least 20, 25, or 30 nucleotides in length, more 
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typically at least 100 nucleotides in length, generally at least 200 nucleotides in length. . 
preferably at least 300 nucleotides in length, more preferably at least 400 nucleotides in 
length, and most preferably at least 500 nucleoUdes in length. 

In particularly preferred embodiments, the RG polynucleotide encodes a RG 
protein which confers resistance to plant pests. This RG protein can be longer, equivalent, 
or shorter than the RG protein encoded by an RG gene. In various embodiments, an RG 
polynucleotide can hybridize under stringent conditions to members of an RG family (an 
RG genus); e.g., it can hybridize to a member of the RGl RG family, such as an RGl 
polynucleotide selected from the group consisting of: SEQ ID N0:1 (RGIA); SEQ ID 
NO:2 and SEQ ID NO:137 (RGIB); SEQ ID NO: 3 (RGIC); SEQ ID N0:4 (RGID); SEQ 
ID N0:5 (RGIE); SEQ ID N0:6 (RGIF); SEQ ID N0:7 (RGIG); SEQ ID N0:8 (RGIH); 
SEQ ID N0:9 (RGII) and SEQ ID NO: 10 (RGl J). 

In other embodiments, the polynucleotide can also hybridize under stringent 
conditions to a member of the RGl family; such as an RG2 polynucleotide selected from 
the group consisting of: SEQ ID N0:21 and SEQ ID NO:27 (RG2A); SEQ ID NO:23 and 
SEQ ID NO:28 (RG2B); SEQ ID N0:29 (RG2C); SEQ ID NO:30 (RG2D); SEQ ID 
NO:31 (RG2E); SEQ ID NO:32 (RG2F); SEQ ID NO:33 (RG2G); SEQ ID NO:34 
(RG2H); SEQ ID NO:35 (RG2D; SEQ ID NO:36 (RG2J); SEQ ID NO:37 (RG2K); SEQ 
ID NO:38 (RG2L); SEQ ID NO:39 (RG2M); SEQ ID NO:87 (RG2A); SEQ ID NO:89 
(RG2B); SEQ ID N0:91 (RG2C); SEQ ID NO:93 (RG2D) and SEQ ID NO:94 (RG2D); 
SEQ ID NO:96 (RG2E); SEQ ID NO:98 (RG2F); SEQ ID NO:100 (RG2G); SEQ ID 
NO:102 (RG2H); SEQ ID NO:104 (RG2D; SEQ ID NO:106 (RG2J) and SEQ ID NO:107 
(RG2J); SEQ ID NO: 109 (RG2K) and (SEQ ID NOrllO (RG2K); SEQ ID N0:112 
(RG2L); SEQ ID N0:114 (RG2M); SEQ ID N0:116 (RG2N); SEQ ID N0:118 (RG20); 
SEQ ID NO:120 (RG2P); SEQ ID N0:122 (RG2Q); SEQ ID N0:124 (RG2S); SEQ ID 
NO:126 (RG2T); SEQ ID N0:128 (RG2U); SEQ ID NO:130 (RG2V); and, SEQ ID 

NO:132(RG2W). 

In alternative embodiments, each RG2 gene can also include an AC15 
sequence which hybridizes under stringent conditions to a polynucleotide selected firom the 
group consisting of: SEQ ID NO:56 (AC15-2A); SEQ ID NO:57 (AC15-2B); SEQ ID 
NO:58 (AC15-2C); SEQ ID NO:59 (AC15-2D); SEQ ID NO:60 (AC15-2E); SEQ ID 
N0:61 (AC15-2G); SEQ ID NO:62 (AC15-2H); SEQ ID NO:63 (AC15-2I); SEQ ID 
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NO:64 (AC15-2J); SEQ ID NO:65 (AC15-2L); SEQ ID NO:66 (AC15-2N); SEQ ID 
NO:67 (AC15-20). 

In other embodiments, an RG polynucleotide can hybridize under stringent 
conditions to an RG3 (SEQ ID NO:68), an RG4 (SEQ ID NO:69), and RG5 (SEQ ID 
5 NO: 135), and an RG7 (SEQ ID NO: 137), RG famUy member. 

The present invention further provides nucleic acid constructs which 
comprise an RG polynucleotide which encodes RG polypeptides from various RG families; 
such as an RG polypeptide having at least 60% sequence identity to an RG polypeptide 
selected from the group consisting of: an RGl polypeptide, an RG2 polypeptide, an RG3 

10 polypeptide, and RG4 polypeptide, and RG5 polypeptide, and an RG7 polypeptide. 

Exemplary RGl polypeptides have the sequences shown in SEQ ID N0:2 
(RGIA), SEQ ID N0:4 (RGIB), SEQ ID N0:6 (RGIC), SEQ ID NO:8 (RGID), SEQ ID 
NO:10 (RGIE), SEQ ID N0:12 (RGIF), SEQ ID N0:14 (RGIG), SEQ ID NO:16 
(RGIH), SEQ ID NO:20 (RGIJ). Exemplary RG2 polypeptides have the sequences shown 

15 in SEQ ID NO:22 and SEQ ID N0:41 (RG2A); SEQ ID NO:24 and SEQ ID NO:42 

(RG2B); SEQ ID NO:43 (RG2C); SEQ ID NO:44 (RG2D); SEQ ID NO:45 (RG2E); SEQ 
ID NO:46 (RG2F); SEQ ID NO:47 (RG2G); SEQ ID NO:48 (RG2H); SEQ ID NO:49 
(RG2D; SEQ ID NO:50 (RG2J); SEQ ID N0:51 (RG2K); SEQ ID NO:52 (RG2L); SEQ 
ID NO:53 (RG2M); SEQ ID NO:88 (RG2A); SEQ ID NO:90 (RG2B); SEQ ID NO:92 

20 (RG2C); SEQ ID NO:95 (RG2D); SEQ ID NO:97 (RG2E); SEQ ID NO:99 (RG2F); SEQ 
ID KO:101 (RG2G); SEQ ID NO:103 (RG2H); SEQ ID NO:105 (RG2I); SEQ ID NO:108 
(RG2J); SEQ ID N0:1 1 1 (RG2K); SEQ ID N0:1 13 (RG2L); SEQ ID N0:1 15 (RG2M); 
SEQ ID N0:1 17 (RG2N); SEQ ID N0:1 19 (RG20); SEQ ID N0:121 (RG2P); SEQ ID 
NO: 123 (RG2Q); SEQ ID N0:125 (RG2S); SEQ ID N0:127 (RG2T); SEQ ID N0:129 

25 (RG2U); SEQ ID NO:131 (RG2V); and, SEQ ID N0:133 (RG2W). 

An exemplary RG3 polypeptide has tiie sequence shown in SEQ ID 
NO: 138. An exemplary RG4 polypeptide has the sequence shown in SEQ ID NO: 139. 
RG polynucleotides will have at least 60% identity, more typically at least 65% identity, 
generally at least 70% identity, and preferably at least 75% identity, more preferably at 

30 least 80% identity, and most preferably at least 85%, 90%, or 95% identity at the deduced 
amino acid level. The regions where substantial identity is assessed can be inclusive or 
exclusive of the nucleotide binding site or the leucine rich region. 
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Vectors and Transcripttonal Control Elanents 
The invention, providing methods and reagents for nuiking novel species 
and genuses of RG nucleic acids described herein, fiirther provides methods and reagents 
for expressing these nucleic acids using novel expression cassettes, vectors, transgenic 
plants and animals, using constitutive and inducible transcriptional and translational cis- 
{e.g.. promoters and enhancers) and tran^-acting control elements. 

The expression of natural, recombinant or synthetic plant disease resistance 
polypeptide-encodmg or other {i.e., antisense. ribozyme) nucleic acids can be achieved by 
operably linking the codmg region a promoter (that can be plant-specific or not. 
constimtive or inducible), incorporating the construct into an expression cassette (such as 
an expression vector), and introducing the resultam construct into an in vitro reaction 
system or a suiuble host cell or organism. Synthetic procedures may also be used. 
Typical expression systems contain, in addition to coding or antisense sequence, 
transcription and translation terminators, polyadenylation sequences, transcription and 
translation initiation sequences, and promoters usefiil for transcribing DNA into RNA. 
The expression systems optionally at least one independent terminator sequence, sequences 
permitting replication of the cassette in vivo, e.g. . plants, eukaryotes, or prokaryotes, or a 
combination thereof, (e.g. , shuttle vectors) and selection markers for the selected 
expression system, e.g., plant, prokaryotic or eukaryotic systems. To ensure proper 
polypesptide expression under varying conditions, a polyadenylation region at the 3'-end of 
the coding region can be included (see Li (1997) Plartt Physiol.n5:32U325, for a review 
of the polyadenylation of RNA in plants). The polyadenylation region can be derived from 
the natural gene, from a variety of other plant genes, or from T-DNA {e.g., using 
Agrobacterium tumefaciens T-DNA replacement veaors. see e.g., Thykjaer (1997) Plant 
Mol Biol. 35:523-530; using a plasmid containing a gene of interest flanked by 
Agrobacterium T-DNA border repeat sequences; Hansen (1997) "T-strand integration in 
maize protoplasts after codeUvery of a T-DNA substrate and virulence genes." Proc. Natl. 
Acad. Set. USA 94: 1 1726-11730. 

To klentify the promoters, the 5' portions of the clones described here are 
analyzed for sequences characteristic of promoter sequences. For instance, promoter 
sequence elements include the TATA box consensus sequence (JATAAT). which is 
usually 20 to 30 base pairs upstream of the transcription start site. In plants, fiirther 



wo 98/30083 PCT/US98AM)615 

13 

upstream from the TATA box, at positions -80 to -100, there is typically a promoter 
element with a series of adenines surrounding the trinucleotide G (or T) N G (see, e.g.. 
Messing, in Genetic Engineering in Plants, pp. 221-227, Kosage, Meredith and 
HoUaender, eds. 1983). If proper polypeptide expression is desired, a polyadenylation 
region at the 3 '-end of the RG coding region should be included. The polyadenylation 
region can be derived from the natural gene, from a variety of other plant genes, or from 
viral genes, such as T-DNA. 

The nucleic acids of the invention can be expressed in expression cassettes, 
vectors or viruses which are transiently expressed in cells using, for example, episomal 
expression systems {e.g., cauliflower mosaic virus (CaMV) viral RNA is generated in the 
nucleus by transcription of an episomal minichromosome containing supercoiled DNA, 
Covey (1990) Proc. Natl Acad, ScL USA 87:1633-1637). Alternatively, coding sequences 
can be inserted into the host cell genome becoming an integral part of the host 
chromosomal DNA. 

Selection markers can be incorporated into expression cassettes and vectors 
to confer a selectable phenotype on transformed cells and sequences coding for episomal 
maintenance and replication such that integration into the host genome is not required. For 
example, the marker may encode biocide resistance, such as antibiotic resistance, 
particularly resistance to chloramphenicol, kanamycin, G418, bleomycin, hygromycin, or 
herbicide resistance, such as resistance to chlorosulfuron or Basta, to permit selection of 
those cells transformed with the desired DNA sequences, see for example, 
Blondelet-Rouault (1997) Gene 190:315-317; Aubrecht (1997) J. Pharmacol. Exp. Ther. 
281:992-997 . Because selectable marker genes conferring resistance to substrates like 
neomycin or hygromycin can only be utilized in tissue culture, chemoresistance genes are 
also used as selectable markers in vitro and in vivo. See also, Mengiste (1997) 
"High-efficiency transformation of Arabidopsis thaliana with a selectable marker gene 
regulated by the T-DNA V promoter," Plant J. 12:945-948, showing that the 1* promoter 
is an attractive alternative to the cauliflower mosaic virus (CaMV) 35S promoter for the 
generation of T-DNA insertion lines, the V promoter may be especially beneficial for the 
secondary u^nsformation of transgenic strains containing the 35S promoter to exclude 
homology-mediated gene silencing. 
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The endogenous promoters from the RG genes of the present invention can - 
be used to direct expression of the genes. These promoters can also be used to direct 
expression of heterologous structoral genes. The promoters can be used, for example, in 
recombinant expression cassettes to drive expression of genes conferring resistance to any 
5 number of pathogens or pests, including fiingi. bacteria, and the like. 
Constitutive Promoters 

In construction of recombinant expression cassettes, vectors, transgenics, of 
the invention, a promoter fragment can be employed to direct expression of the desired 
gene in all tissues of a plant or animal. Promoters that drive expression continuously 
10 under physiological conditions are referred to as "constitutive" promoters and are active 
under most enviromnental conditions and states of development or cell differentiation. 
Examples of constitotive promoters include those from viruses which infect plants, such as 
the cauliflower mosaic virus (CaMV) 35S transcription initiation region; the 1'- or 2'- 
promoter derived from T-DNA of Agrobacterium tumafaciens; tiie promoter of the tobacco 
15 mosaic virus; and. other transcription initiation regions from various plant genes known to 
tiiose of skill. See also Holtorf (1995) "Comparison of different constitutive and inducible 
promoters for the overexpression of transgenes mArabidopsis thaHana," Plant Mol. Biol. 

29:637-646. 

Inducible Promoters 

20 Alternatively, a plant promoter may direct expression of the plant disease 

resistance mxcleic acid of the invention under tiie influence of changing enviromnental 
conditions or developmental conditions. Examples of enviromnental conditions that may 
effect transcription by inducible promoters include pathogenic attack, anaerobic conditions, 
elevated temperature, drought, or the presence of light. Such promoters are refened to 
25 herein as "inducible" promoters. For example, the invention incorporates the drought- 
inducible promoter of maize (Busk (1997) supra); the cold, drought, and high salt 
inducfcle promoter from potato (Kirch (1997) Plant Mol. Biol. 33:897-909). 

Embodunents of flie invention also incorporate use of plant promoters which 
are inducible upon injury or mfection to express tiie invention's plant disease resistance 
30 (RG ) polypeptides. Various embodiments include use of. e.g., the promoter for a tobacco 
(Nicotiam tabacum) sesquiterpene cyclase gene (EAS4 promoter), which is expressed in 
wounded leafs, roots, and stem tissues, and upon infection witii microbial pathogens (Yin 
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(1997) Plant Physiol. 1 15(2):437-451); the 0RF13 promoter from Agrobacterium 
rhizogenes 8196, which is wound inducible in a limited area adjacent to the wound site 
(Hansen (1997) MoL Gen, Genet. 254:337-343); the Shpx6b gene promoter, which is a 
plant peroxidase gene promoter induced by microbial pathogens (demonstrated using a 
5 fungal pathogen, see Curtis (1997) MoL Plant Microbe Interact. 10:326-338); the 

woimd-inducible gene promoter wunl, derived from potato (Siebertz (1989) Plant Cell 
1:961-968); the ^oundAnducihle Agrobacterium pmas gene (mannopine synthesis gene) 
promoter (Guevara-Garcia (1993) Plant J. 4:495-505). 

Alternatively, plant promoters which are inducible upon exposure to plant 

10 hormones, such as auxins, are used to express the nucleic acids of the invention. For 
example, the invention can use the auxin-response elements El promoter fragment 
(AuxREs) in the soybean (Glycine max L.) (Liu (1997) Plant Physiol. 115:397-407); the 
auxin-responsive Arabidopsis GST6 promoter (also responsive to salicylic acid and 
hydrogen peroxide) (Chen (1996) Plant J. 10: 955-966); the auxin-inducible parC 

15 promoter from tobacco (Sakai (1996) 37:906-913); a plant biotm response element (Streit 
(1997) MoL Plant Microbe Interact. 10:933-937); and, the promoter responsive to the 
stress hormone abscisic acid (Sheen (1996) Science 274:1900-1902). 

Plant promoters which are inducible upon exposure to chemicals reagents 
which can be applied to the plant, such as herbicides or antibiotics, are also used to express 

20 the nucleic acids of the invention. For example, the maize In2-2 promoter, activated by 
benzenesulfonamide herbicide safeners, can be used (De Veylder (1997) Plant Cell 
Physiol. 38:568-577); application of different herbicide safeners induces distinct gene 
expression patterns, includmg expression in the root, hydathodes, and the shoot apical 
meristem. Coding sequence can be under the control of, e.g., a tetracycline-inducible 

25 promoter, e.g., as described with transgenic tobacco plants containmg the Avena sativa L. 
(oat) arginine decarboxylase gene (Masgrau (1997) Plant J, 11:465-473); or, a salicylic 
acid-responsive element (Stange {1997) Plant J. 11:1315-1324. Using chemically- (e.g., 
hormone- or pesticide-) induced promoters, harvesting of fruits and plant parts would be 
greatly facilitated. A chemical which can be applied to the transgenic plant in the field and 

30 induce expression of a polypeptide of the invention throughout all or most of the plant 
would make a environmentally safe defoliant or herbicide. Thus, the invention also 
pro\ndes for transgenic plants containing an inducible gene encoding for the RG 



I 



PCT/US9W00615 

WO98«0083 ig 

polypeptides of the invention whose host range is limited to target plant species, such as . 
weeds or crops before, during or after harvestmg. 

Abcission promoters are activated upon plant ripening, such as fruit 
ripening, and are especially usefiil uicorporated in the expression systems {e.g., expression 
cassettes, vectors) of the invention. In some embodiments, when a plant disease resistant 
polypeptide-encoding nucleic acid is under the control of such a promoter, rapid ceU death, 
induced by expression of the invention's polypeptide, can accelerate and/or accentuate 
abcission, increasing the efficiency of the harvestmg of fruits or other plant parts, such as 
cotton, and the like. Induction of rapid cell death at this time would accelerate separation 
of the fruit from the plant, greaUy augmenting harvesting procedures. See, e.g., Kalaitzis 
(1997) Plant Physiol. 113:1303-1308, discussing tomato leaf and flower abscission; Payton 
(1996) Plant Mol. Biol. 31:1227-1231, discussing ethylene receptor expression regulation 
during fruit ripening, flower senescence and abscission; Koehler (1996) Plant Mol. Biol. 
31:595-606, discussing the gene promoter for a bean abscission cellulase; Kalaitzis (1995) 
Plant Mol. Biol.1%: 647-656, discussing cloning of a tomato polygalacturonase expressed 
in abscission; del Campiilo (1996) Plant Physiol. 111:813-820, discussing pedicel 
breakstrength and cellulase gene expression during tomato flower abscission. 
Tissue-Specific Promoters 

Tissue specific promoters are transcriptional control elements that are only 
active in particular cells or tissues. Pkmt promoters which are active only in specific 
tissues or at specific times during plant development are used to express the nucleic acids 
of the invention. Examples of promoters under developmental control include promoters 
tiiat initiate transcription only in certain tissues, such as leaves, roots, fruit, seeds, ovules, 
pollen, pistols, or flowers. Such promoters are referred to as "tissue specific". The 
operation of a promoter may also vary depending on its location in the genome. Thus, an 
inducible promoter may become fully or partiaUy constitutive in certain locations. 

For example, a seed-specific promoter directs expression in seed tissues. 
Such promoters may be, for example, ovule-specific, embryo-specific, endosperm-specific, 
inteeument-specific, seed coat-specific, or some combination thereof. A leaf-specific 
promoter has been identified in maize. Busk (1997) PUwt J. 11:1285-1295. The 0RF13 
promoter from Agrobacterium rhizogenes exhibits high activity in roots (Hansen (1997) 
supra). A maize pollen-specific promoter has been identified m maize (Guerrero (1990) 



wo 98/30083 ^ ^ PCT/US98/00615 

17 

MoL Gen, Genet. 224:161-168). A tomato promoter active during fruit ripening, 
senescence and abscission of leaves and, to a lesser extent, of flowers can be used (Blume 
(1997) Plant /. 12:731-746). A pistol specific promoter has been identified in the potato 
{Solanum tuberosum L.) SK2 gene, encoding a pistil-specific basic endochitinase (Ficker 
5 (1997) Plant Mol Biol. 35:425-431). The Blec4 gene flrom pea (Pisum sativum cv. 
Alaska) is active in epidermal tissue of vegetative and floral shoot apices of transgenic 
alfalfa, making it a useful tool to target the expression of foreign genes to the epidermal 
layer of actively growing shoots. The activity of the Blec4 promoter in the epidermis of the 
shoot apex makes it particularly suitable for genetically engineering defense against insects 
10 and diseases that attack the growing shoot apex (Mandaci (1997) Plant Mol Biol. 
34:961-965). 

The invention also provides for use of tissue-specific plant promoters 
include a promoter from the ovule-specific BELl gene described in Reiser (1995) Cell 
83:735-742, GenBank No. U39944. Suitable seed specific promoters are derived from the 

15 foUowmg genes: MACl from maize, Sheridan (1996) Genetics 142:1009-1020; Cat3 from 
maize, GenBank No. L05934. Abler (1993) Plant Mol Biol 22:10131-1038; the gene 
encoding oleosin 18kD from maize, GenBank No. J05212, Lee (1994) Plant Mol Biol 
26:1981-1987; vivparous-1 from Arabidopsis, GenbankNo. U93215; the gene encoding 
oleosin from Arabidopsis, Genbank No. Z17657; Atmycl from Arabidopsis, Urao (1996) 

20 Plant Mol Biol 32:571-576; the 2s seed storage protein gene family from Arabidopsis, 
Conceicao (1994) Plant 5:493-505; the gene encoding oleosin 20kD from Brassica napus, 
GenBank No. M63985; napA from Brassica napus, GenBank No. J02798, Josefsson 
(1987) JBL 26:12196-1301; the napin gene family from Brassica napus, Sjodahl (1995) 
Planta 197:264-271; the gene encodmg the 2S storage protein from Brassica napus, 

25 Dasgupta (1993) Gene 133:301-302; the genes encoding oleosin a, Genbank No. U09118, 
and, oleosin B, Genbank No. U09119, from soybean; and, the gene encoding low 
molecular weight sulphur rich protein from soybean, Choi (1995) Mol Gen, Genet. 
246:266-268. The tissue specific E8 promoter from tomato is particularly useful for 
directing gene expression so that a desired gene product is located in fruits. Other suitable 

30 promoters include those from genes encoding embiryonic storage proteins. 

One of skill will recognize that a tissue-specific promoter may drive 
expression of operably linked sequences in tissues other than the target tissue. Thus, as 
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used herein a tissue-specific promoter is one that drives expression preferentially in the . 
target tissue, but may also lead to some expression in other tissues as well. 

The invention also provides for use of tissue-specific promoters derived 
from viruses which can include. e.g., the tobamoviius subgenomic promoter (Kumagai 
(1995) Proc. Natl. Acad. Sd. USA 92:1679-1683; the rice mngro bacaiiform virus 
(RTBV). which replicates only m phloem cells in infected rice plants, with its promoter 
which drives strong phloem-specific reporter gene expression; the cassava vein mosaic 
virus (CVMV) promoter, with highest activity in vascular elements, in leaf mesophyll 
cells, and in root tips (Verdaguer (1996) Plant Mol. Biol. 31:11^9-1139). 

In some embodiments, the nucleic acid construct wUl comprise a promoter 
functional in a specific plant cell, such as in a species of iMCtuca, operably linked to an RG 
polvnucleotide. Promoters useful in these embodiments include RG promoters. In 
additional embodiments, the nucleic acid construct will comprise a RG promoter operably 
linked to a heterologous polynucleotide. The heterologous polynucleotide is chosen to 
provide a plant with a desired phenotype. For example, the heterologous polynucleotide 
can be a strucwral gene which encodes a polypeptide which imparts a desired resistance 
phenotype. Alternatively, the heterologous polynucleotide may be a regulatory gene which 
might play a role in transcriptional and/or uranslational control to suppress, enhance, or 
otherwise modify the transcription and/or expression of an endogenous gene widiin the 
plant. The heterologous polynucleotide of the nucleic acid construct of die present 
invention can be expressed m either sense or anti-sense orientation as desired. It will be 
appreciated that control of gene expression in eitiier sense or anti-sense orientation can 
have a direct impact on the observable plant characteristics. 
Modifying and Inhibiting RG Gene Expression 

The invention also provides for RG nucleic acid sequences which are 
complementary to the RG polypeptide-encoding sequences of tiie invention; i.e., antisense 
RG nucleic acids. Antisense technology can be conveniently used to modify gene 
expression in plants. To accomplish this, a micleic acid segment from the desired gene is 
cloned and operably linked to a promoter such that die anti-sense strand of RNA will be 
transcribed. The construct is then transformed into plants and the antisense strand of RNA 
is produced. In plant cells, it has been shown fliat antisense RNA inhibits gene expression 
by preventing die accumulation of mRNA which encodes die enzyme of interest, see, e.g. . 
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Sheehy (1988) Proc. Nat, Acad. Set USA 85:8805-8809; Hiatt et al., U.S. Patent No. 
4,801,340. 

Antisense sequences are capable of inhibiting the transport, splicing or 
transcription of RG-encoding genes. The inhibition can be effected through the targetmg 

5 . of genomic DNA or messenger RNA. The transcription or function of targeted nucleic 
acid can be inhibited, e.g., by hybridization and/or cleavage. One particularly useful set 
of inhibitors provided by the present invention includes oligonucleotides which are able to 
either bind RG gene or message, in either case preventing or inhibiting the production or 
funcuon of RG. The association can be though sequence specific hybridization. Such 

10 inhibitory nucleic acid sequences can, for example, be used to completely inhibit a plant 
disease resistance response. Another useful class of inhibitors includes oligonucleotides 
which cause inactivation or cleavage of RG message. The oligonucleotide can have 
enz>Tne activity which causes such cleavage, such as ribozymes. The oligonucleotide can 
be chemically modified or conjugated to an enzyme or composition capable of cleaving the 

15 complementary nucleic acid. One may screen a pool of many different such 
oligonucleotides for those with the desired activity. 
Antisense Oligonucleotides 

The invention provides for with antisense oligonucleotides capable of 
binding RG message which can inhibit RG activity by targeting mRNA. Strategies for 

20 designing antisense oligonucleotides are well described in the scientific and patent 
- literamre, and the skilled artisan can design such RG oligonucleotides using the novel 
reagents of the invention. In some situations, naturally occurring nucleic acids used as 
antisense oligonucleotides may need to be relatively long (18 to 40 nucleotides) and present 
at high concentrations. A wide variety of synthetic, non-naturally occurring nucleotide and 

25 nucleic acid analogues are known which can address this potential problem. For example, 
peptide nucleic acids (PNAs) containing non-ionic backbones, such as N-(2-aminoethyl) 
glycine units can be used. Antisense oligonucleotides having phosphorothioate linkages 
can also be used, as described in WO 97/03211; WO 96/39154; Mata (1997) Toxicol Appl 
Phannacol 144:189-197; Antisense Therapeutics, ed. Agrawal (Humana Press, Totowa, 

30 NJ., 1996). Antisense oligonucleotides having synthetic DNA backbone analogues 
provided by the invention can also include phosphoro-dithioate, methylphosphonate. 
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phosphoramidate. alkyl phosphotriester, sulfamate, 3"-(hioacetal. methylene(methylimino). . 
3'-N-carbamate, and morpholino carbamate nucleic acids, as described herein. 

Combinatorial chemistry methodology can be used to create vast numbers of 
oligonucleotides that can be rapidly screened for specific oligonucleotides that have 
appropriate binding affinities and specificities toward any target, such as the sense and 
antisense RG sequences of the invention (for general background mformation. see. e.g.. 
Gold (1995) J. of Biol. Chem. 270: 13581-13584). • 

Inhibitory Ribozymes 

The invention provides for with ribozymes capable of binding RG message 
which can inhibit RG activity by targeting mRNA. Strategies for designing ribozymes and 
selecung the RG-specific antisense sequence for targeting are well described in flie 
scientific and patent literature, and the skilled artisan can design such RG ribozymes using 
tiie novel reagents of the invention. Ribozymes act by binding to a target RNA tiux)Ugh the 
target RNA binding portion of a ribozyme which is held in close proximity to an enzymatic 
portion of die RNA tiiat cleaves die target RNA. Thus, the ribozyme recognizes and binds 
a taraet RNA dirough complementary base-pairing, and once bound to the correct site, acts 
enzy^caUy to cleave and inactivate the target RNA. Cleavage of a target RNA in such a 
manner will destroy its ability to direct symhesis of an encoded protein if the cleavage 
occurs in tiie coding sequence, or, preventing transport of die message from die nucleus to 
the o-toplasm. After a ribozyme has bound and cleaved its RNA target, it is typically 
released from tiiat RNA and so can bind and cleave new targets repeatedly. 

Catalytic RNA molecules or ribozymes can also be used to inhibit 
expression of any plant gene. It is possible to design ribozymes tiiat specifically pair widi 
virtually any target RNA and cleave tiie phosphodiester backbone at a specific location, 
thereby fiinctionally inactivating tiie target RNA. In carrying out diis cleavage, die 
ribozyme is not itself altered, and is tims capable of recycling and cleaving oflier 
moleailes, makmg it a true enzyme. The inclusion of ribozyme sequences wiflun antisense 
RNAs confers RNA-cleaving activity upon tiiem, fliereby increasing tiie activity of die 
constructs. The design and use of target RNA-specific ribozymes is described, e.g., in 
Haseioff (1988) Nature 334:585-591. 

In some circumstances, die enzymatic nature of a ribozyme can be 
advantageous over otiier technologies, such as antisense technology (where a micleic acid 
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molecule simply binds to a nucleic acid target to block its transcription, translation or 
association with another molecule) as the effective concentration of ribozyme necessary to 
effect a therapeutic treatment can be lower than that of an antisense oligonucleotide. This 
potential advantage reflects the ability of the ribozyme to act enzymatically. Thus, a single 
5 ribozyme molecule is able to cleave many molecules of target RNA. In addition, a 
ribozyme is typically a highly specific inhibitor, with the specificity of inhibition 
depending not only on the base pairing mechanism of binding, but also on the mechanism 
by which the molecule inhibits the expression of the RNA to which it binds. That is, the 
inhibition is caused by cleavage of the RNA target and so specificity is defined as the ratio 

10 of the rate of cleavage of the targeted RNA over the rate of cleavage of non-targeted RNA. 
This cleavage mechanism is dependent upon factors additional to those involved in base 
pairing. Thus, the specificity of action of a ribozyme can be greater than that of antisense 
oligonucleotide binding the same RNA site. 

The enzymatic ribozyme RNA molecule can be formed in a hammerhead 

15 motif, but may also be formed in the motif of a hairpin, hepatitis delta virus, group I intron 
or RNaseP-like RNA (in association with an RNA guide sequence). Examples of such 
hammerhead motifs are described by Rossi (1992) Aids Research and Human Retroviruses 
8:183; hairpin motifs by Hampel (1989) Biochemistry 28:4929, and Hampel (1990) Nuc, 
Acids Res. 18:299; the hepatitis delta virus motif by Perrotta (1992) Biochemistry 31:16; 

20 the RNaseP motif by Guerrier-Takada (1983) Cell 35:849; and the group I intron by Cech 
U.S. Pat. No. 4,987,071 . The recitation of these specific motifs is not intended to be 
limiting; those skilled in the art will recognize that an enzymatic RNA molecule of this 
invention has a specific substrate binding site complementary to one or more of the target 
gene RNA regions, and has nucleotide sequence within or surrounding that substrate 

25 binding site which inq>arts an RNA cleaving activity to the molecule. 
Sense Supression 

Another method of suppression is sense suppression. Introduction of 
nucleic acid configured in the sense orientation has been shown to be an effective means by 
which to block the transcription of target genes. For an example of the use of this method 
30 to modulate expression of endogenous genes see, Napoli et al.. The Plant Cell 2:279-289 
(1990), and U.S. Patent No. 5,034,323. 
Cloning ofRG Polypeptides 
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Synthesis and/or cloning of RG polynucleotides and isolated nucleic acid 
constructs of the present invention are provided by methods well known to those of 
ordinary skill in the art. Generally, the nomenclature and the laboratory procedures in 
recombinant DNA technology described below are those weU known and commonly 
employed in the art. Standard techniques are used for cloning, DNA and RNA isohrtion. 
ampUfication and purification. Generally enzymatic reactions involving DNA ligase. DNA 
polymerase, restriction endonucleases and the like are performed according to the 
manufactorer's specifications. These techniques and various other techniques are generally 
performed according to Sambrook et al. , Molecular Clordng - A Laboratory Manual, Cold 
Spring Harbor Laboratory, Cold Spring Harbor, New York, (1989). 

The isolation of RG genes may be accomplished by a number of techniques. 
For instance, oligonucleotide probes based on the sequences disclosed here can be used to 
identify the desired gene in a cDNA or genomic DNA library. To construct genomic 
libraries, large segments of genomic DNA are generated by random fragmentation, e.g. 
using restriction endonucleases. and are ligated with vector DNA to form concatemers that 
can be packaged into the appropriate vector. To prepare a cDNA library, mRNA is 
isolated from the desired organ, such as roots and a cDNA library which contains the RG 
gene transcript is prepared from the mRNA. Alternatively, cDNA may be prepared from 
mRNA extracted from other tissues in which RG genes or homologs are expressed. 

The cDNA or genomic library can then be screened using a probe based 
upon the sequence of a cloned RG gene such as the genes disclosed herein. Probes may be 
used to hybridize with genomic DNA or cDNA sequences to isolate homologous genes in 
the same or different plant species. 

Those of skm in the art will appreciate that various degrees of stringency of 
hybridization can be employed in the assay ; and either the hybridization or the wash 
medram can be stringent. As the conditions for hybridization become more stringent, 
there must be a greater degree of complementarity between the probe and the target for 
duplex formation to occur. The degree of stringency can be controlled by temperature, 
ionic strength, pH and the presence of a partially denatoring solvent such as formamide. 
For example, the stringency of hybridization is conveniendy varied by changing die po- 
larity of die reactant solution dirough manipulation of die concendration of formamide 
within the ranee of 0% to 50%. 
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Alternatively, the RG nucleic acids of the invention can be amplified from . 
nucleic acid samples using a variety of amplification techniques, such as polymerase chain 
reaction (PCR) technology, to amplify the sequences of the RG and related genes directly 
from genomic DNA, from cDNA, from genomic libraries or cDNA libraries. PCR and 
5 other in vitro amplification methods may also be useful, for example, to clone nucleic acid 
sequences that code for proteins to be expressed, to make nucleic acids to use as probes for 
detecting the presence of the desired mRNA in samples, for nucleic acid sequencing, or for 
other piurposes. 

Oligonucleotides can be used to identify and detect additional RG families 

10 and RG family species using a variety of hybridization techniques and conditions. Suitable 
ampliBcation methods include, but are not limited to: polymerase chain reaction, PCR 
(PCR Protocols, A Guide to Methods and Applications, ed, Innis, Academic Press, 
N.Y. (1990) and PCR STRATEGIES (1995), ed, Innis, Academic Press, Inc., N.Y. (Innis 
)), ligase chain reaction (LCR) (Wu (1989) Genomics 4:560; Landegren (1988) Science 

15 241 : 1077; Barringer (1990) Gene 89: 1 17); transcription amplification (Kwoh (1989) Proc. 
Natl. Acad. Sci, USA 86:1173); and, self-sustained sequence replication (Guatelli (1990) 
Proc, Natl, Acad. Sci. USA, 87:1874); Q Beta replicase amplification and other RNA 
polymerase mediated techniques (e.g., NASBA, Cangene, Mississauga, Ontario); see 
Berger (1987) Methods Enzymol. 152:307-316, Sambrook, and Ausubel, as well as Mullis 

20 (1987) U.S. Patent Nos. 4,683.195 and 4,683,202; Amheun (1990) C&EN 36-47; Lomell 
/. Clin. Chem., 35:1826 (1989); Van Brunt, Biotechnology, 8:291-294 (1990); Wu (1989) 
Gene 4:560; Sooknanan (1995) Biotechnology 13:563-564. Methods for cloning in vitro 
amplified nucleic acids are described in Wallace, U.S. Pat. No. 5,426,039. 

The degree of complementarity (sequence identity) required for detectable 

25 binding will vary in accordance with the stringency of the hybridization medium and/or 

wash medium. The degree of complementarity will optimally be 100 percent; however, it 
shoiild be understood that minor sequence variations in the probes and primers may be 
compensated for by reducing the stringency of the hybridization and/or wash medium as 
described earlier. 

30 In some preferred embodiments, members of this class of pest resistance 

genes can be identified by their ability to be amplified by PCR primers based on the 
sequences disclosed here. Appropriate primers and probes for identifying RG sequences 
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from plant tissues are generated from comparisons of the sequences provided herein. See, . 
e.g.. Table 1. For a general overview of PCR see PCR Protocols: A Guide to Methods 
and Applications. (Innis, M» Gelfand, D.» Sninsky, J. and White, T., eds.), Acadendc 
Press, San Diego (1990), incorporated herein by reference. 

5 Briefly, the first step of each cycle of the PCR involves the separation of the 

nucleic acid duplex formed by the primer extension. Once the strands are separated, the 
next step in PCR involves hybridizmg the separated strands with primers that flank the 
target sequence. The primers are then extended to form complementary copies of the 
target strands. For successful PCR amplification, the primers are designed so that the 

10 position at which each primer hybridizes along a duplex sequence is such that an extension 
product synthesized from one primer, when separated from the template (complement), 
serves as a template for the extension of the other primer. The cycle of denaturation, 
hybridization, and extension is repeated as many tunes as necessary to obtain the desired 
amount of amplified nucleic acid. 

15 In the preferred embodiment of the PCR process, strand separation is 

achieved by heating the reaction to a sufficiently high temperature for an sufficient time to 
cause the denaturation of the duplex but not to cause an irreversible denaturation of the 
polymerase (see U.S. Patent No. 4,965,188). Template-dependent extension of primers in 
PCR is catalyzed by a polymerizing agent in the presence of adequate amounts of four 

20 deoxjTibonucleotide triphosphates (typically dATP. dGTP. dCTP. and dTTP) in a reaction 
medium comprised of the appropriate salts, metal cations, and pH buffering system. 
Suitable polymerizing agents are enzymes known to catalyze template-dependent DNA 
synthesis. 

Polynucleotides may also be synthesized by well-known techniques as 
25 described m the technical literature. See, e.g., Carruthers et al.. Cold Spring Harbor 
Symp. Quant. Biol 47:41 M18 (1982), and Adams et al., J. Am. Chem. Soc. 105:661 
(1983). Double stranded DNA fragments may then be obtained either by synthesizing the 
complementary strand and annealing the strands together under appropriate conditions, or 
by adding the complementary strand using DNA polymerase with an appropriate primer 
30 sequence. 



RG Proteins 



wo 98/30083 PCT/US98/00615 

The present invention further provides isolated RG proteins encoded by the . 
RG polynucleotides disclosed herein. One of skill will recognize that the nucleic acid 
encoding a functional RG protein need not have a sequence identical to the exemplified 
genes disclosed here. For example, because of codon degeneracy a large number of 

S nucleic acid sequences can encode the same polypeptide. In addition, the polypeptides 
encoded by the RG genes, like other proteins, have different domains which perform 
different functions. Thus, the RG gene sequences need not be full length, so long as the 
desired functional domain of the protein is expressed. 

The resistance proteins are at least 25 amino acid residues in length. 

10 Typically, the RG proteins are at least 50 amino acid residues, generally at least 100, 

preferably at least 150, more preferably at least 200 amino acids in lenjgth. In particularly 
preferred embodiments, the RG proteins are of sufficient length to provide resistance to 
pests when expressed in the desired plants. Generally then, the RG proteins will be the 
length encoded by an RG gene of the present invention. However, those of ordinary skill 

IS will appreciate that minor deletions, substitutions, or additions to an RG protein will 

typically yield a protein with pest resistance characteristics similar or identical to that of 
the full length sequence. Thus, full-length RG proteins modified by 1, 2, 3, 4, or 5 
deletions, substitutions, or additions, generally provide an effective degree of pest 
resistance relative to the full-length protein. 

20 The RG proteins which provide pest resistance will typically comprise at 

least one of an LRR or an NBS. Preferably, both are present. LRR and/or NBS regions 
present in the RG proteins of the present invention can be provided by RG genes of the 
present invention. In some embodiments, the LRR and/or NBS regions are obtained from 
other pest resistance genes. See, e.g., Yu et aL, Proc. Natl. Acad. ScL USA, 93: 11751- 

25 11756 (1996); Bent et al , Science, 265: 1856-1860 (1994). 

Modified protein chains can also be readily designed utilizing various 
recombinant DNA techniques well known to those skilled in the art. For example, the 
chains can vary from the naturally occurring sequence at the primary structure level by 
amino acid substitutions, additions, deletions, and the like. Modification can also include 

30 swapping domains from the proteins of the invention with related domains from other pest 
resistance genes. 
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Pests that can be targeted by RG genes and proteins of the present invention . 
include such bacterial pests as Erwinia carotovora and Pseudomonas marginalis. Fungal 
pests which can be targeted by the present invention include Bremia laaucae, Marssonina 
parumoniana, Rhizoctonia solani, Olpidium brassicae, root aphid, Sclerotinia 
5 sclerotiorum and 5. minor, and Botrytis cinerea which causes gray mold. RG genes also 
provide resistance to viral diseases such as lettuce and turnip mosaic viruses. 
Fusion Proteins 

RG polypeptides can also be expressed as recombinant proteins with one or 
more additional polypeptide domains linked thereto to facilitate protein detection, 

10 purification, or other applications. Such detection and purification facilitating domams 
include, but are not limited to, metal chelating peptides such as polyhistidine tracts and 
histidine-tryptophan modules that allow purification on immobilized metals, protein a 
domains that allow purification on immobilized immunoglobulin, and the domain utilized 
in the FLAGS extension/affinity purification system (Immunex Corp, Seattle WA). The 

15 inclusion of a cleavable linker sequences such as Factor Xa or enterokinase (Invitrogen, 
San Diego CA) between the purification domain and plant disease resistant polypeptide 
may be useful to facilitate purification. One such expression vector provides for 
expression of a fusion protein comprising the sequence encoding a plant disease resistant 
polypeptide of the invention and nucleic acid sequence encoding six histidine residues 

20 followed by thioredoxin and an enterokinase cleavage site (e.g., see Williams (1995) 

Biochemistry 34:1787-1797). The histidine residues facilitate detection and purification 
while the enterokinase cleavage site provides a means for purifying the desired protein(s) 
from the remainder of the fusion protein. Technology pertaining to vectors encoding 
fusion protems and application of fusion proteins are well described, see e.g., KroU 

25 (1993) DNA Cell Biol, 12:441-53. 

Antibodies Reactive to RG Polypeptides and Immunological Assays 

The present invention also provides antibodies which specifically react with 
RG proteins of the present invention under immunologically reactive conditions. An 
antibody immunologically reactive with a particular antigen can be generated in vivo or by 

30 recombinant methods such as selection of libraries of recombinant antibodies in phage or 
similar vectors, "hnmunologically reactive conditions" includes reference to conditions 
which allow an antibody, generated to a particular epitope of an antigen, to bind to that 
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epitope to a detectably greater degree than the antibody binds to substantially all other 
epitopes, generally at least two times above background binding, preferably at least five 
times above background. Immunologically reactive conditions are dependent upon the 
format of the antibody binding reaction and Qrpically are those utilized in immunoassay 
5 protocols. 

"Antibody" mcludes reference to an immunoglobulin molecule obtained by 
in vitro or in vivo generation of the humoral response, and includes both polyclonal and 
monoclonal antibodies. The term also includes genetically engineered forms such as 
chimeric antibodies (e.g., humanized murine antibodies), heteroconjugate antibodies (e.g., 

10 bispecific antibodies), and recombinant single chain Fv fragments (scFv). The term 

"antibody" also includes antigen binding forms of antibodies (e.g.. Fab', F(ab*)2, Fab, Fv, 
rIgG, and, inverted IgG). See, Pierce Catalog and Handbook, 1994-1995 (Pierce 
Chemical Co., Rockford, IL). An antibody immunologically reactive with a particular 
antigen can be generated in vivo or by recombinant methods such as selection of libraries 

15 of recombinant antibodies in phage or similar vectors. See, e.g., Huse et al. (1989) 

Science 246:1275-1281; and Ward, et al (1989) Nature 341:544-546; and Vaughan et al 
{1^6) Nature Biotechnology, 14:309-314. 

Many methods of making antibodies are known to persons of skill. A 
number of immunogens are used to produce antibodies specifically reactive to an isolated 

20 RG protein of the present invention under immunologically reactive conditions. An 
isolated recombinant, synthetic, or native RG protein of the present invention is the 
preferred immunogens (antigen) for the production of monoclonal or polyclonal antibodies. 

The RG protein is then injected into an animal capable of producing 
antibodies. Either monoclonal or polyclonal antibodies can be generated for subsequent 

25 use in immunoassays to measure the presence and quantity of the RG protein. Methods of 
producing monoclonal or polyclonal antibodies are known to those of skill in the art. See, 
e.g., Coligan (1991) Current Protocols in Immunology Wiley/Greene, NY; and Harlow 
and Lane (1989) Antibodies: A Laboratory Manual Cold Spring Harbor Press, NY); 
Coding (1986) Monoclonal Antibodies: Principles and Practice (2d ed.) Academic Press, 

30 New York, NY. 

Frequently, the RG proteins and antibodies will be labeled by joining, either 
covalently or non-covalently, a substance which provides for a detectable signal. A wide 
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variety of labels and conjugation techniques are known and are reported extensively in both 
the scientific and patent literature. Suitable labels include radionucleotides, enzymes, 
substrates, cofactors, inhibitors, fluorescent moieties, chemiluminescent moieties, 
magnetic particles, and the like. Patents teachmg the use of such labels include U.S. 
5 Patent Nos. 3,817.837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 4,275,149; and 
4,366,241. 

The antibodies of the present invention can be used to screen plants for the 
expression of RG proteins of the present invention. The antibodies of this mvention are 
also used for affinity chromatography in isolating RG protein. 

10 The present invention finrther provides RG polypeptides that specifically 

bind, under immunologically reactive conditions, to an antibody generated against a 
defined immunogen, such as an inununogen consisting of the RG polypeptides of the 
present invention. Immunogens will generally be at least 10 contiguous amino acids from 
an RG polypeptide of the present invention. Optionally, immunogens can be from regions 

15 exclusive of the NBS and/or LRR regions of the RG polypeptides. Nucleic acids which 
encode such cross-reactive RG polypeptides are also provided by the present invention. 
The RG polypeptides can be isolated from any number plants as discussed earlier. 
Preferred are species from the family Compositae and m particular the genus Lactuca such 
as L. sativa and such subspecies as crispa, longifolia, and asparagina. 

20 "Specifically binds" includes reference to the preferential association of a 

ligand, in whole or part, with a particular target molecule (i.e., "binding parmer" or 
"binding moiety") relative to compositions lacking that target molecule. It is, of course, 
recognized that a certain degree of non-specific interaction may occur between a ligand and 
a non-target molecule. Nevertheless, specific binding, may be distinguished as mediated 

25 through specific recognition of the target molecule. Typically specific binding results in a 
much stronger association between the ligand and the target molecule than between the 
ligand and non-target molecule. Specific binding by an antibody to a protein under such 
conditions requires an antibody that is selected for its specificity for a particular protein. 
The affinity constant of the antibody binding site for its cognate monovalent antigen is at 

30 least 10^, usually at least Itf , preferably at least itf , more preferably at least 10^^, and 
mosi preferably at least 10*^ liters/mole. A variety of immunoassay formats are 
appropriate for selecting antibodies specifically reactive with a particular protein. For 
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example, solid-phase ELISA immunoassays are routinely used to select monoclonal 
antibodies specifically reactive with a protein. See Harlow and Lane (1988) Antibodies, A 
Laboratory Manual, Cold Spring Harbor Publications, New York, for a description of 
immunoassay formats and conditions that can be used to determine specific reactivity. The 

5 antibody may be polyclonal but preferably is monoclonal. Generally, antibodies cross- 
reactive to such proteins as KPS2, RPMl (bacterial resistances in Arabidopsis, L6 (fungal 
resistance in flax, PRF (resistance to Pseudomonas syringae in tomator), and N, (virus 
resistance in tobacco), are removed by immunoabsorbtion. 

Inmnmoassays in the competitive binding format are typically used for 

10 cross-reactivity determinations. For example, an immunogenic RG polypeptide is 

immobilized to a solid support. Polypeptides added to the assay compete with the binding 
of the antisera to the immobilized antigen. The ability of the above polypeptides to 
compete with the binding of the antisera to the immobilized RG polypeptide is compared to 
the immunogenic RG polypeptide. The percent cross-reactivity for the above proteins is 

15 calculated, using standard calculations. Those antisera with less than 10% cross-reactivity 
with such proteins as RPS2, RPMl, L6, PRF, and N, are selected and pooled. The cross- 
reacting antibodies are then removed from the pooled antisera by immunoabsorbtion with 
these non-RG resistance proteins. 

The immunoabsorbed and pooled antisera are then used in a competitive 

20 binding immunoassay to compare a second "target" polypeptide to the unmunogenic 

polypeptide. In order to make this comparison, the two polypeptides are each assayed at a 
wide range of concentrations and the amount of each polypeptide required to inhibit 50% 
of the binding of the antisera to the immobilized protein is determined using standard 
techniques. If the amount of the target polypeptide required is less than twice the amount 

25 of the immunogenic polypeptide that is required, then the target polypeptide is said to 
specifically bind to an antibody generated to the immunogenic protein. As a final 
determination of specificity, the pooled antisera is fully immunosorbed with the 
immunogenic polypeptide until no bindmg to the polypeptide used in the immunosorbtion 
is detectable. The fully immunosorbed antisera is then tested for reactivity with the test 

30 polypeptide. If no reactivity is observed, then the test polypeptide is specifically bound by 
the antisera elicited by the immunogenic protein. 
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Production o f transpenic plants of the invention 

Isolated nucleic acid constructs prepared as described herein can be 

introduced into plants according techniques known in the art. In some embodiments, the 

introduced nucleic acid is used to provide RG gene expression and therefore pest resistance 
5 in desired plants. In some embodiments, RG promoters are used to drive expression of 

desired heterologous genes in plants. Finally, in some embodiments, the constructs can be 

used to suppress expression of a target endogenous gene, including RG genes. 

To use isolated RG sequences in the above techniques, recombinant DNA 

vectors suitable for transformation of plant cells are prepared. Techniques for 
10 transforming a wide variety of higher plant species are well known and described in the 

technical and scientific literature. See, for example, Weising et al Am. Rev. Genet. 

22:421-477 (1988). 

A DNA sequence coding for the desired RG polypeptide, for example a 

cDNA or a genomic sequence encoding a full length protein, will be used to construct a 
15 recombinant expression cassette which can be introduced into the desired plant. An 

expression cassette will typically comprise the RG polynucleotide operably linked to 

transcriptional and translational initiation regulatory sequences which will direct the 

transcription of the sequence from the RG gene m the intended tissues of the transformed 

plant. 

20 Such DNA constructs may be introduced into the genome of the desired 

plant host by a variety of conventional techniques. For example, the DNA construct may 
be introduced directly into the genomic DNA of the plant cell using techniques such as 
electroporation, PEG poration, particle bombardment and microinjection of plant cell 
protoplasts or embryogenic callus, or the DNA constructs can be introduced directly to 

25 plant tissue using ballistic methods, such as DNA particle bombardment. Alternatively, 
the DNA constiucts may be combined with suitable T-DNA flanking regions and 
introduced into a conventional Agrobacterium tumefaciens host vector. The virulence 
functions of the Agrobacterium tumefaciens host will direct the insertion of the construct 
and adjacent marker into the plant cell DNA when the cell is infected by the bacteria. 

30 Transformation techniques are known in the art and well described in the 

scientific and patent literature. The introduction of DNA constructs using polyethylene 
glycol precipitation is described in Paszkowski et al. Embo J. 3:2717-2722 (1984). 
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Electroporation techniques are described in Fronun et al. Proc. Natl. Acad. ScL USA 
82:5824 (1985). Ballistic transformation techniques are described in Klein et ai Nature 
327:70-73 (1987). 

Agrobacterium tumefacienS'medii2Ltsd transformation techniques are well 
5 described in the scientific literature. See, for example Horsch et al. Science 233:496-498 
(1984), and Fraley et al. Proc. Natl. Acad. Sci. USA 80:4803 (1983). Although 
Agrobacterium is useful primarily in dicots, certain monocots can be transformed by 
Agrobacterium. For instance, Agrobacterium transformation of rice is described by Hiei et 
al. Plant J. 6:271-282 (1994). A particularly preferred means of transforming lettuce is 

10 described in Michehnore et al. Plant Cell Reports^ 6:439-442 (1987). 

Transformed plant cells which are derived by any of the above 
transformation techniques can be cultured to regenerate a whole plant which possesses the 
transformed genotype and thus the desired RG-controUed phenotype. Such regeneration 
techniques rely on manipulation of certain phytohormones in a tissue culture growth 

15 medium, typically relying on a biocide and/or herbicide marker which has been introduced 
together with the RG nucleotide sequences. Plant regeneration from cultured protoplasts is 
described in Evans et al.. Protoplasts Isolation and Culture, Handbook of Plant Cell 
Culture, pp. 124-176, Macmillilan Publishing Company, New York, 1983; and Binding, 
Regeneration of Plants, Plant Protoplasts, pp. 21-73, CRC Press, Boca Raton, 1985. 

20 Regeneration can also be obtained from plant callus, explants, organs, or parts thereof. 
Such regeneration techniques are described generally in Klee et al. Ann. Rev. of Plant 
Phys. 38:467-486 (1987). 

The methods of the present invention are particularly useful for 
incorporating the RG polynucleotides mto transformed plants in ways and under 

25 circumstances which are not found naturally. In particular, the RG polypeptides may be 
expressed at times or in quantities which are not characteristic of natural plants. 

One of skiU will recognize that after the expression cassette is stably 
incorporated in transgenic plants and confirmed to be operable, it can be introduced into 
other plants by sexual crossing. Any of a number of standard breeding techniques can be 

30 used, depending upon the species to be crossed. 



Detecrion of RG Resistance Genes 
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The present invention further provides methods for detecting RG resistance - 
genes in a nucleic acid sample suspected of conq)rising an RG resistance gene. The means 
by which the RG resistance gene is detected is not a critical aspect of the invention. For 
example, RG resistance genes can be detected by the presence of amplicons using RG 

S resistance gene specific primers. Additionally, RG resistance genes can be detected by 
assaying for specific hybridization of an RG polynucleotide to an RG resistance gene. In 
some embodiments, the RG resistance gene can be amplified prior to the step of contactmg 
the nucleic acid sample with the RG polynucleotide. 

In a typical detection method, the nucleic acid sample is contacted with an 

10 RG polynucleotide to form a hybridization complex. The hybridization complex may be 
detected directly (e.g., in Southern or northern blots), or indirectly (e.g,, by subsequent 
primer extension during PGR amplification). The RG polynucleotide hybridizes under 
stringent conditions to an RG polynucleotide of the invention. Formation of the 
hybridization complex is directly or indirectly used to indicate the presence of the RG 

IS resistance gene in the nucleic acid sample. 

Detection of the hybridization complex can be achieved using any number of 
well known methods. For example, the nucleic acid sample, or a portion thereof, may be 
assayed by hybridization formats includmg but not lunited to, solution phase, solid phase, 
mixed phase, or in situ hybridization assays. Briefly, in solution (or liquid) phase 

20 hybridizations, both the target nucleic acid and the probe or primer are free to interact in 
the reaction mixture. In solid phase hybridization assays, probes or primers are typically 
linked to a solid support where they are available for hybridization with target nucleic in 
solution. In mixed phase, nucleic acid intermediates in solution hybridize to target nucleic 
acids in solution as well as to a nucleic acid linked to a solid support. In in situ 

25 hybridization, the target nucleic acid is liberated fi-om its cellular surroundings in such as 
to be available for hybridization within the cell while preserving the cellular morphology 
for subsequent interpretation and analysis. The following articles provide an overview of 
the various hybridization assay formats: Singer et al, Biotechniques 4f5):230-250 (1986); 
Haase et a/.. Methods in Virology, Vol. VE, pp. 189-226 (1984); Wilkinson, "The theory 

30 and practice of in situ hybridization" In: In situ Hybridization, Ed. D.G. Wilkinson. IRL 
Press, Oxford University Press, Oxford; and Nucleic Acid Hybridization: A Practical 
Approach, Ed. Hames, B.D. and Higgins, S.J., IRL Press (1987). 
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The effect of the modification of RG gene expression can be measured by . 
detection of increases or decreases in mRNA levels using, for instance. Northern blots. In 
addition, the phenotypic effects of gene expression can be detected by measuring 
nematode, fungal, bacterial, viral, or other pest resistance in plants. Suitable assays for 

5 determining pest resistance are well known. Michelmore and Crate, Trans. Br. mycoL 
Soc, 79(3): 542-546 (1982). 

The means by which hybridization complexes are detected is not a critical 
aspect of the present invention and can be accomplished by any number of methods 
currently known or later developed. RG polynucleotides can be labeled by any one of 

10 several methods typically used to detect the presence of hybridized nucleic acids. One 
common method of detection is the use of autoradiography using probes labeled with^H, 
^^I, -*-S. **C, or ^^P, or the like. The choice of radioactive isotope depends on research 
preferences due to ease of synthesis, stability, and half lives of the selected isotopes. 
Other labels include ligands which bind to antibodies labeled with fluorophores, 

15 chemilummescent agents, and enzymes. Alternatively, probes can be conjugated directly 
with labels such as fluorophores, chemiluminescent agents or enzymes. The choice of 
label depends on sensitivity required, ease of conjugation with the probe, stability 
requirements, and available instrumentation. Labeling the RG polynucleotide is readily 
achieved such as by the use of labeled PGR primers. 

20 The choice of label dictates the manner in which the label is bound to the 

probe. Radioactive probes are typically made using commercially available nucleotides 
containing the desired radioactive isotope. The radioactive nucleotides can be incorporated 
into probes, for example, by using DNA synthesizers, by nick translation with DNA 
polymerase I, by tailing radioactive DNA bases to the 3* end of probes with terminal 

25 deoxynucleotidyl transferase, by treating single-stranded M13 plasmids having specific 
inserts with the Klenow fragment of DNA polymerase in the presence of radioactive 
deoxy nucleotides, dNTP, by transcribing from RNA templates using reverse transcriptase 
in the presence of radioactive deoxynucleotides, dNTP, or by transcribing RNA from 
vectors containing specific RNA viral promoters (e.g., SP6 promoter) using the 

30 corresponding RNA polymerase (e.g., SP6 RNA polymerase) in the presence of 
radioactive ribonucleotides rNTP. 
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The probes can be labeled using radioactive nucleotides in which the isotope 
resides as a part of the nucleotide molecule, or in which the radioactive component is 
attached to the nucleotide via a terminal hydroxyl group that has been esterified to a 
radioactive component such as inorganic acids, e.g., 32P phosphate or 14C organic acids, 

5 or esterified to provide a linking group to the label. Base analogs having nucleophilic 
linking groups, such as primary amino groups, can also be linked to a label. 

Non-radioactive probes are often labeled by indirect means. For example, a 
ligand molecule is covalently bound to the probe. The ligand then binds to an anti-ligand 
molecule which is either inherently detectable or covalently bound to a detectable signal 

10 system, such as an enzyme, a fluorophore, or a chemiluminescent compound. Enzymes of 
interest as labels will primarily be hydrolases, such as phosphatases, esterases and glyco- 
sidases, or oxidoreductases, particularly peroxidases. Fluorescent compounds include 
fluorescein and its derivatives, rhodamine and its derivatives, dansyl, umbelliferone, etc. 
Chemiluminescers include luciferin, and 2,3-dihydrophthalazinediones, e.g., lummol. 

15 Ligands and anti-ligands may be varied widely. Where a ligand has a natural anti-ligand, 
namely ligands such as biotin, thyroxine, and Cortisol, it can be used in conjunction with 
its labeled, naturally occurring anti-ligands. Alternatively, any haptenic or antigenic 
compound can be used in combination with an antibody. 

Probes can also be labeled by du-ect conjugation with a label. For example, 

20 cloned DNA probes have been coupled directly to horseradish peroxidase or alkaline 
phosphatase, (Renz. M., and Kurz, K. (1984) A Colorimetric Method for DNA 
Hybridization. NucL Acids Res. 12: 3435-3444) and synthetic oligonucleotides have been 
coupled directly with alkalme phosphatase (Jablonski, E., et al. (1986) Preparation of 
Oligodeoxynucleotide-Alkaline Phosphatase Conjugates and Their Use as Hybridization 

25 Probes. Nuc, Acids. Res. 14: 6115-6128; and Li P., et al. (1987) Enzyme-linked Synthetic 
Oligonucleotide probes: Non-Radioactive Detection of Enterotoxigenic Escherichia Coli 
m Faeca Specimens. Nucl. Acids Res. 15:5275-5287). 

Definitions 

30 Units, prefixes, and symbols can be denoted in their SI accepted form. 

Numeric ranges are inclusive of the numbers defining the range. Unless otherwise 
indicated, nucleic acids are written left to right in 5* to 3* orientation, respectively. The 
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headings provided herein are not limitations of the various aspects or embodhnents of the . 
invention which can be had by reference to the specification as a whole. Accordingly » the 
terms defined immediately below are more fully defined by reference to the specification as 
a whole. 

S As used herein^ the term "plant" includes reference to whole plants, plant 

organs (e.g., leaves, stems, roots, etc.), seeds and plant cells and progeny of same. The 
class of plants which can be used in the methods of the invention is generally as broad as 
the class of higher plants amenable to transformation techniques, including both 
monocotyledonous and dicotyledonous plants. 

10 As used herein, "pest" includes, but is not limited to, viruses, fungi, 

nematodes, insects, and bacteria. 

As used herein, "heterologous" is a nucleic acid that originates from a 
foreign species, or, if firom the same species, is substantially modified from its original 
form. For example, a promoter operably linked to a heterologous structural gene is from a 

15 species different from that from which the structural gene was derived, or, if from the 
same species, one or both are substantially modified firom their original form. 

As used herein, "RG gene," alternatively referred to as **RLG gene," is a 
gene encoding resistance to plant pests, such as vuiises, fungi, nematodes, insects, and 
bacteria, and which hybridizes under stringent conditions and/or has at least 60% sequence 

20 identity at the deduced amino acid level to the exemplified sequences provided herein. RG 
genes encode "RG polypeptides/ alternatively referred to as "RLG polypeptides." which 
can comprise LRR motifs and/or NBS motifs. The RG polypeptides encoded by RG genes 
have at least 55% or 60% sequence identity, typically at least 65% sequence identity, 
preferably at least 70% sequence identity, often at least 75% sequence identity, more 

25 preferably at least 80% sequence identity, and most preferably at least 90% sequence 

identity at the deduced amino acid level relative to the exemplary RG sequences provided 
herein. The term "RG family" or "RG family genus" or "genus" includes reference to a 
group of RG polypeptide sequence species that have at least 60% amino acid sequence 
identity, and, the nucleic acids encoding these polypeptides. The individual species of a 

30 genus, i.e., the members of a family, typically are genetically mapped to the same locus. 

As used herein, "RG polynucleotide" includes reference to a contiguous 
sequence from an RG gene of at least 18, 20, 25, 30, 40, or 50 nucleotides in length, up to 
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at least about 100 or at least about 200 nucleotides in length. In some embodiments, the - 
polynucleotide is preferably at least 100 nucleotides in length, more preferably at least 200 
nucleotides in length, most preferably at least 500 nucleotides in length. Thus, RG 
polynucleotide may be a RG gene or a subsequence thereof. 

As used herein, "isolated,** when referring to a molecule or composition, 
such as, for example, an RG polypeptide or nucleic acid, means that the molecule or 
composition is separated from at least one other compound, such as a protein, other nucleic 
acids (e.g., RNAs), or other contaminants with which it is associated in vivo or in its 
naturally occurring state. Thus, an RG polypeptide or nucleic acid is considered isolated 
when it has been isolated from any other component with which it is naturally associated, 
e.g.^ cell membrane, as in a cell extract. An isolated composition can, however, also be 
substantially pure. An isolated composition can be in a homogeneous state and can be in a 
dry or an aqueous solution. Purity and homogeneity can be determined, for example, 
using analytical chemistry techniques such as polyacrylamide gel electrophoresis (SDS- 
PAGE) or high performance liquid chromatography (HPLC). 

The term "nucleic acid" or "nucleic acid molecule** or "nucleic acid 
sequence" refers to a deoxyribonucleotide or ribonucleotide oligonucleotide in either 
single- or double-stranded form. The term encompasses nucleic acids, i.e. , 
oligonucleotides, containing known analogues of natural nucleotides which have similar or 
improved binding properties, for the purposes desired, as the reference nucleic acid. The 
term also includes nucleic acids which are metabolized in a manner similar to naturally 
occurring nucleotides or at rates that are improved thereover for the purposes desired. The 
term also encompasses nucleic-acid-like structures with synthetic backbones. DNA 
backbone analogues provided by the invention include phosphodiester, phosphorothioate, 
phosphorodithioate, methylphosphonate, phosphoramidate, alkyl phosphotriester, 
sulfamate, 3*-thioacetal, methylene(methylimino), 3'-N-carbamate, morpholino carbamate, 
and peptide nucleic acids (PNAs); see Oligonucleotides and Analogues, a Practical 
Approach, edited by F. Eckstein, IRL Press at Oxford University Press (1991); Antisense 
Strategies, Annals of the New York Academy of Sciences, Volume 600, Eds. Baserga and 
Denhardt (NYAS 1992); Milligan (1993) 7. Med. Chem. 36:1923-1937; Antisense 
Research and Applications (1993, CRC Press). PNAs contain non-ionic backbones, such 
as N-(2-aminoethyl) glycine units. Phosphorothioate linkages are described in WO 



wo 98/30083 PCT/US98/00615 

97/0321 1 ; WO 96/39154; Mata (1997) Toxicol Appl Pharmacol 144: 189-197. Other 
synthetic backbones encompasses by the term inchide methyl-phosphonate linkages or 
alternating methylphosphonate and phosphodiester linkages (Strauss-Soukup (1997) 
Biochemistry 36:8692-8698), and benzylphosphonate linkages (Samstag {\996) Antisense 
5 Nucleic Acid Drug Dev 6: 153-156). The term nucleic acid is used interchangeably with 
gene, cDNA, mRNA, oligonucleotide primer, probe and amplification product. Unless 
otherwise indicated, a particular nucleic acid sequence includes the conq)lementary 
sequence thereof. 

The term "exogenous nucleic acid" refers to a nucleic acid that has been 

10 isolated, synthesized, cloned, ligated, excised in conjunction with another nucleic acid, in 
a manner that is not found in nature, and/or introduced into and/or expressed in a cell or 
cellular environment other than or at levels or forms different than the cell or cellular 
environment in which said nucleic acid or protein is be found in namre. The term 
encompasses both nucleic acids originally obtained from a different organism or cell type 

15 than the cell type in which it is expressed, and also nucleic acids that are obtained from the 
same cell line as the cell line in which it is expressed, invention. 

The term "recombinant," when used with reference to a cell, or to the 
nucleic acid, protein or vector refers to a material, or a material corresponding to the 
natural or native form of the material, that has been modified by the introduction of a new 

20 moieiy or alteration of an existing moiety, or is identical thereto but produced or derived 
from synthetic materials. For example, recombinant cells express genes that are not found 
within the native (non-recombinant) form of the cell or express native genes that are 
otherwise expressed at a different level, typically, under-expressed or not expressed at all. 
The term "recombinant means** encompasses all means of expressing, i.e., transcription 

25 or translation of, an isolated and/or cloned nucleic acid in vitro or in vivo. For example, 
the term "recombinant means** encompasses techniques where a recombinant nucleic acid, 
such as a cDNA encoding a protein, is inserted into an expression vector, the vector is 
introduced into a cell and the cell expresses the protein. "Recombinant means'* also 
encompass the ligation of nucleic acids having coding or promoter sequences from 

30 different sources into one vector for expression of a fusion protein, constitutive expression 
of a protein, or inducible expression of a protein, such as the plant disease resistant, or 
RG. polypeptides of the invention. 
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The term "specifically hybridizes" refers to a nucleic acid that hybridizes, . 
duplexes or binds to a particular target DNA or RNA sequence. The target sequences can 
be present in a preparation of total cellular DNA or RNA. Proper annealing conditions 
depend, for example, upon a nucleic acid's, such as a probe's length, base composition, 
5 and the number of mismatches and their position on the probe, and can be readily 

determined empirically providing the appropriate reagents are available. For discussions 
of nucleic acid probe design and annealing conditions, see, e.g., Sambrook and Ausubel. 

The terms "stringent hybridization," "stringent conditions," or "specific 
hybridization conditions" refers to conditions under which an oligonucleotide (when used, 

10 for example, as a probe or primer) will hybridize to its target subsequence, such as an RG 
nucleic acid in an expression vector of the invention but not to a non-RG sequence. 
Stringent conditions are sequence-dependent. Thus, in one set of stringent conditions an 
oligonucleotide probe will hybridize to only one specie of the genus of RG nucleic acids of 
the invention. In another set of stringent conditions (less stringent) an oligonucleotide 

15 probe will hybridize to all species of the invention's genus but not to non-RG nucleic 
acids. Longer sequences hybridize specifically at higher temperatures. Stringent 
conditions are selected to be about 5^C lower than the thermal melting point (TJ for the 
specific sequence at a defined ionic strength and pH. The T„ is the temperature (under 
defined ionic strength, pH, and nucleic acid concentration) at which 50% of the probes 

20 complementar>' to the target sequence hybridize to the target sequence at equilibrium (if 
the target sequences are present in excess, at T„, 50% of the probes are occupied at 
equilibrium). Typically, stringent conditions will be those in which the salt concentration 
is less than about 1.0 M sodium ion, i.e., about 0.01 to 1.0 M sodium ion concentration 
(or other salts) at pH 7.0 to 8.3 and the temperature is at least about 3(Pc for short probes 

25 (e.g., 10 to 50 nucleotides) and at least about 60^C for long probes (e.g., greater than 50 
nucleotides). Stringent conditions may also be achieved with the addition of destabilizing 
agents such as formamide. Often, high stringency wash conditions preceded by low 
stringency wash conditions to remove background probe signal. An example of medium 
stringency wash conditions for a duplex of, e.g., more than 100 nucleotides, is Ix SSC at 

30 45^C for 15 minutes (see Sambrook for a description of SSC buffer). An example low 

stringency wash for a duplex of, e.g., more than 100 nucleotides, is 4-'6x SSC at 4(Pc for 
15 minutes, a signal to noise ratio of 2x (or higher) than that observed for an unrelated 
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probe in the particular hybridization assay indicates detection of a "specific hybridization."! 

Nucleic acids which do not hybridize to each other under stringent conditions can still be 

substantially identical if the polypeptides which they encode are substantially identical. 

This can occurs, e.g, , when a nucleic acid is created that encodes for conservative 
5 substitutions. Stringent hybridization and stringent hybridization wash conditions are 

different under different environmental parameters, such as for Southern and Northern 

hybridizations. An extensive guide to the hybridization of nucleic acids is found in, e.g., 

Sambrook, Tijssen (1993) siq?ra. 

As used herein "operably linked" includes reference to a functional linkage 
10 between a promoter and a second sequence, wherein the promoter sequence initiates and 

mediates transcription of the DNA sequence corresponding to the second sequence. 

Generally, operably linked means that the nucleic acid sequences being linked are 

contiguous and, where necessary to join two protein coding regions, contiguous and in the 

same reading frame. 

15 In the expression of transgenes one of skill will recognize that the inserted 

polynucleotide sequence need not be identical and may be "substantially identical" to a 
sequence of the gene from which it was derived. As explained herein, these variants are 
specifically covered by this term. 

In the case where the mserted polynucleotide sequence is transcribed and 

20 translated to produce a functional RG polypeptide, one of skill will recognize that because 
of codon degeneracy, a number of polynucleotide sequences will encode the same 
polypeptide. These variants are specifically covered by the term "RG polynucleotide 
sequence" . In addition, the term specifically includes those fiiU length sequences 
substantially identical (determined as described herein) with an RG gene sequence which 

25 encode proteins that retain the function of the RG protein. Thus, in the case of RG genes 
disclosed here, the term includes variant polynucleotide sequences which have substantial 
identity with the sequences disclosed here and which encode proteins capable of conferring 
resistance to nematodes, bacteria, viruses, fungi, insects or other pests on a transgenic 
plant comprising the sequence. 

30 Two polynucleotides or polypeptides are said to be "identical" if the 

sequence of nucleotides or amino acid residues, respectively, in the two sequences is the 
same when aligned for maximum correspondence, as described below. The term 
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"complementary to" is used herein to mean that the complementary sequence is identical to- 
all or a specified contiguous portion of a reference polynucleotide sequence. 

The terms "sequence identity," "sequence similarity" and "homology" refer 
to when two sequences, such as the nucleic acid and amino acid sequences or the 
5 polypeptides of the invention, when optimally aligned, as with, for example, the programs 
PILEUP, BLAST, GAP, FASTA or BESTFTT (see discussion, supra). "Percentage amino 
acid/nucleic apid sequence identity" refers to a comparison of the sequences of two 
polypeptides/nucleic acids which, when optimally aligned, have approxunately the 
designated percentage of the same amino acids/nucleic acids, respectively. For example, 

10 . "605? sequence identity" and "60% homology" refer to a comparison of the sequences of 
two RG nucleic acids or polypeptides which, when optimally aligned, have 60% identity. 
For example, in one embodiment, nucleic acids encoding RG polypeptides of the invention 
comprise a sequence with at least 50% nucleic acid sequence identity to SEQ ID NO:L In 
other embodiments, the RG polypeptides of the invention are encoded by nucleic acids 

15 comprising a sequence with at least 50% sequence identity to SEQ ID N0:1, or, are 
encoded by nucleic acids comprising SEQ ID N0:1, or, have at least 60% amino acid 
sequence identity to the polypeptide of SEQ ID N0:2. 

"Percentage of sequence identity" is determined by comparing two optimally 
aligned sequences over a comparison window, wherein the portion of the polynucleotide 

20 sequence in the comparison window may comprise additions or deletions (i.e., gaps) as 
compared to the reference sequence (which does not comprise additions or deletions) for 
optimal alignment of the two sequences. The percentage is calculated by determming the 
number of positions at which the identical nucleic acid base or amino acid residue occurs 
in both sequences to yield the number of matched positions, dividing the number of 

25 matched positions by the total number of positions in the window of comparison and 
multiplying the result by 100 to yield the percentage of sequence identity. 

The term "substantial identity" of polynucleotide sequences means that a 
polynucleotide comprises a sequence that has at least 55% or 60% sequence identity, 
generally at least 65%, preferably at least 70%, often at least 75%, more preferably at 

30 least 80% and most preferably at least 90%, compared to a reference sequence using the 
programs described above (preferably BESTFTT) using standard parameters. One of skill 
will recognize that these values can be appropriately adjusted to determine corresponding 
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identity of proteins encoded by two nucleotide sequences by taking into account codon 
degeneracy, amino acid similarity, reading firame positioning and the like. Substantial 
identity of amino acid sequences for these purposes nonnally means sequence identity of at 
least 55% or 60%, preferably at least 70%, more preferably at least 80%, and most 

5 preferably at least 95% . Polypeptides having "sequence similarity" share sequences as 
noted above except that residue positions which are not identical may differ by 
conservative amino acid changes. Conservative amino acid substitutions refer to the 
interchangeability of residues having similar side chains. For example, a group of amino 
acids having aliphatic side chains is glycine, alanine, valine, leucine, and isoleucine; a 

10 group of amino acids having aliphatic-hydroxyl side chains is serine and threonine; a group 
of amino acids having amide-containing side chains is asparagine and glutamine; a group 
of amino acids having aromatic side chains is phenylalanine, tyrosine, and tryptophan; a 
group of amino acids having basic side chains is lysine, arginine, and histidine; and a 
group of amino acids having sulfur-containing side chains is cysteine and methionine. 

15 Preferred conservative amino acids substimtion groups are: valine-leucine-isoleucine, 
phenylalanine-tyrosine, lysine-arginine, alanine-valine, and asparagine-glutamine. 

Another indication that nucleotide sequences are substantially identical is if 
two molecules hybridize to each other under appropriate conditions. Appropriate 
conditions can be high or low stringency and will be different in different cuxumstances. 

20 Generally, stringent conditions are selected to be about 5^C to about 20^C lower than the 
thermal meltmg point (Tm) for the specific sequence at a defmed ionic strength and pH. 
The Tm is the temperature (under defmed ionic strength and pH) at which 50% of the 
target sequence hybridizes to a perfectly matched probe. Typically, stringent wash 
conditions are those in which the salt concentration is about 0.02 molar at pH 7 and the 

25 temperamre is at least about 5(f C. However, nucleic acids which do not hybridize to each 
other under stringent conditions are still substantially identical if the polypeptides which 
they encode are substantially identical. This may occur, e,g,, when a copy of a nucleic 
acid is created using the maximum codon degeneracy permitted by the genetic code. 

Nucleic acids of the invention can be identified from a cDNA or genomic 

30 library prepared according to standard procedures and the nucleic acids disclosed here used 
as a probe. Thus, for example, stringent hybridization conditions will typically include at 
least one low stringency wash using 0.3 molar salt (e.g., 2X SSC) at 6fC, The washes 
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are preferably followed by one or more subsequent washes using 0.03 molar salt (e.g., 
0.2X SSC) at 50^C, usually 60°C, or mosre usually 65*^0. Nucleic acid probes used to 
identify the nucleic acids are preferably at least 100 nucleotides in length. 

As used herein, "nucleotide binding site" or "nucleotide binding domain" 
5 ("NBS") includes reference to highly conserved nucleotide-. Le., ATP/GTP-, bmding 
domains, typically included in the "kinase domain'* of kinase polypeptides, such as a 
kinase-la, kinase 2, or a kmase 3a motif, as described herein. For example, the tobacco N 
and Arabidopsis RPS2 genes, among several recently cloned disease-resistance genes, 
share highly conserved NBS sequence. Kinase NBS subdomains further consist of three 

10 subdomain motifs: the P-loop, kmase-2, and kmase-3a subdomains (Yu (1996) Proc. Acad, 
ScL USA 93:11751-11756). As discussed in detail herein, examples include the 
Arabidopsis RPP5 gene (Parker (1997) supra), the A. thaliana RPS2 gene (Mindrinos 
(1997) supra), and the Oax L6 rust resistance gene (Lawrence (1995) supra) which all 
encode proteins containing an NBS; and Mindrinos (1994) Cell 78:1089-1099; and Shen 

15 (1993) FEES 335:380-385. Using the teachings disclosed and incorporated herein and 
standard nucleic acid hybridization and/or amplification techniques, one of skill can 
identify members having NBS domains, including any of the genus of NBS-containing 
plam disease resistant polypeptides of the invention. 

As used herein, "leucine rich region" ("LRR") includes reference to a 

20 region that has a leucine content of at least 20% leucine or isoleucine, or 30% of the 

aliphatic residues: leucine, isoleucine, methionine, valine, and phenylalanine, and arranged 
with approximate repeated periodicity. The length of the repeat may vary in length but is 
generally about 20 to 30 amino acids. An LRR-containing polypeptide typicially will have 
the canonical 24 amino acid leucine-rich rq)eat (LRR) sequence , which is present m 

25 different proteins that mediates molecular recognition and/or mteraction processes; as 
described in Bent (1994) Science 265:1856-1860; Parker (1997) Plant Cell 9:879-894; 
Hong (1997) Plant PhysioL 113:1203-1212; Schmitz (1997) Nucleic Acids Res. 
25:756-763; Hipskind (1996) MoL Plant Microbe Interaa. 9:819-825; Tomero (1996) 
Plam /. 10:315-330; Dixon (1996) Cell 84:451-459; Jones (1994) Science 266:789-793; 

30 Lawrence (1995) Plant Cell 7:1195-1206; Song (1995) Science 270:1804-1806; as 

discussed in further detail supra. Using the teachings disclosed and incorporated herein 
and standard nucleic acid hybridization and/or amplification techniques, one of skill can 
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identify polypeptides having LRR domains, including any member of the genus of LRR- . 

containing RG polypeptides of the invention. 

The term "promoter" refers to a region or sequence determinants located 

upstream or downstream from the start of transcription and which are involved in 
5 recognition and binding of RNA polymerase and other proteins to initiate transcription. A 

"plant promoter" is a promoter capable of initiating and/or regulating transcription in plant 

cells; see also discussion on plant promoters, supra. 

The term "constitutive promoter" refers to a promoter that initiates and 

helps control transcription in all tissues. Promoters that drive expression continuously 
10 under physiological conditions are referred to herem as "constitutive" promoters and are 

active under most environmental conditions and states of development or cell 

differentiation; see also detailed discussion, supra. 

The term "inducible promoter" refers to a promoter which directs 

transcription under the influence of changing environmental conditions or developmental 
15 conditions. Examples of environmental conditions that may effect transcription by 

inducible promoters include anaerobic conditions, elevated temperature, drought, or the 

presence of light. Such promoters are referred to herein as "inducible" promoters; see also 

detailed discussion, supra. 

The term "abscission-induced promoter" or "abcission promoter" refers to a 
20 class of promoters which are activated upon plant ripening, such as fruit ripening, and are 

especially useful incorporated in the expression systems (e.g., expression cassettes, 

vectors) of the invention. When the plant disease resistant polypeptide-encoding nucleic 

acid is under the control of an abcission promoter, rapid cell death, mduced by expression 

of the invention's polypeptide, accelerates and/or accentuates abcission of the plant part, 
25 increasing the efficiency of the harvesting of fruits or other plant parts, such as cotton, and 

the like; see also detailed discussion, supra. 

The term "tissue-specific promoter" refers to a class of transcriptional 

control elements that are only active in particular cells or tissues. Examples of plant 

promoters under developmental control include promoters that initiate transcription only 
30 (or primarily only) in certain tissues, such as roots, leaves, fruit, ovules, seeds, pollen, 

pistols, or flowers; see also detailed discussion, supra. 
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As used herein "recombinant" includes reference to a cell, or nucleic acid, . 
or vector, that has been modified by die introduction of a heterologous nucleic acid or the 
alteration of a native nucleic acid to a form not native to that cell, or diat the cell is derived 
from a cell so modified. Thus, for example, recombinant cells express genes that are not 
5 found within the native (non-recombinant) form of the cell or express native genes that are 
otherwise abnormally expressed, under expressed or not expressed at all. 

As used herein, a "recombinant expression cassette" or ''expression 
cassette** is a nucleic acid construct, generated recombinantly or synthetically, with a series 
of specified nucleic acid elements which permit transcription of a particular nucleic acid in 
10 a target cell. The expression vector can be part of a plasmid, virus, or nucleic acid 

fragment. Typically, the recombinant expression cassette portion of the expression vector 
includes a nucleic acid to be transcribed, and a promoter. 

As used herein, "transgenic plant" includes reference to a plant modified by 
introduction of a heterologous polynucleotide. Generally, the heterologous polynucleotide 
15 is an RG structural or regulatory gene or subsequences thereof. 

As used herein, "hybridization complex" includes reference to a duplex 
nucleic acid sequence formed by selective hybridization of two single-stranded nucleic 
acids with each other. 

As used herein, "amplified" includes reference to an increase in the molarity 
20 of a specified sequence. Amplification methods include the polymerase chain reaction 
(PGR), the ligase chain reaction (LCR), the transcription-based amplification system 
(TAS), the self-sustained sequence replication system (SSR). A wide variety of clonmg 
methods, host cells, and in vitro anqilification methodologies are well-known to persons of 
sldU. 

25 As used herein, "nucleic acid sample" includes reference to a spechnen 

suspected of comprismg RG resistance genes. Such specunens are generally derived, 

directly or indirectiy, from lettuce tissue. 

The term "antibody" refers to a polypeptide substantially encoded by an 

immunoglobulin gene or immunoglobulin genes, or fragments or synthetic or recombinant 
30 analogues thereof which specifically bind and recognize analytes and antigens, such as a 

genus or subgenus of polypeptides of the invention, as described supra. 
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It is understood that the examples and embodiments described herein are for . 
illustrative purposes only and that various modifications or changes in light thereof will be 
suggested to persons skilled in the art and are to be included within the spuit and purview 
of this application and scope of the appended claims. 

5 

EXAMPLES 

The following examples are offered to illustrate, but not to limit the claimed 

invention. 
Example 1 

10 Example 1 describes the use of PCR to amplify RG genes from lettuce. 

Multiple primers with low degeneracy, particularly at the 3' end, were 
designed based on the sequences of two known resistance genes from tobacco and flax. 
DNA Templates 

Lettuce genomic DNA was extracted from cultivar Diana and a mutant line 
15 derived from cultivar Diana using a standard CTAB protocol. To generate cDNA 
templates, RNA was isolated from cultivar Diana and the mutant following standard 
procedures; first strand cDNA was synthesized using Superscript reverse transcriptase 
from 1 Og total RNA as specified by the manufacturer (Life Technologies). 6 AC (bacterial 
artificial chromosome) clones from the Dm3 region were isolated from a BAC library of 
20 over 53,000 clones using marker AC15 that was known to be closely linked to Dm3, 
Bacterial plasmids containing clones of L6 and RPS2 were used as positive controls. 

PCR with degenerate oligonucleotide primers 

Oligonucleotide primers were designed based on conserved motifs in the 
25 nucloetide binding sites (NBS) of L6, RPS2, and N. Eight primers were made 

corresponding to the GVGKTT motif in the sense direction; each had 64-fold degeneracy. 
Six primers were made to the GLPLAL motif in the anti-sense direction; with either 16 or 
256-fold degeneracy (Table 1). 

Oligonucleotides included 14-mer adaptors of (CUA)^ at the 5' end of the 
30 sense primers and (CAU)4 at the 5* end of the antisense primers to allow rapid cloning of 
the PCR products into pAMPl (Life Technologies). 
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PGR amplification was performed in 50 Ol reaction volume with 1 OM of . 
each of a pair of sense and antisense primers. The templates were denatured by heating to 
94EC for 2 min. This was followed by 35 cycles of 30 sec at 94EC, 1 min at 50EC, 2 mm 
at 72EC, with a single final extension of 5 min at 72EC. 25 ng of genomic DNA or cDNA 
was used. BAG clones as templates required less. The final dNTP concentration was 0.2 
mM; MgClj was 1.5 mM. 

Forty-eight combinations of sense and antisense primers were tested on a 
panel of nine templates consisting of two genomic DNA sanq)les» two cDNA preparations, 
three BAG clones and plasmids containing L6 and RPS2 as positive controls. 
Amplification from L6 and RPS2 resulted in fragments of 516 and 513 repectively. Seven 
combinations of primers resulted in fragments of approximately this size with multiple 
templates (Table 2). Primers that gave RLG products were: PLOOPAA, PLOOPAG, 
PLOOPGA, PLOOPGG, PLOOPAG, GLPL3. GLPL4. 



(Intentionally left blank) 
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DEGENERATE PRIMER SEQUENCES for NBS PCR 

Sense primers based on GVGKTT amino acid sequence from L6, N and rps2 FLOOP 
ntotif: 

PLOOPAG 5* GGNGTNGGNAAAACGAC 3* 
PLOOPAA 5' GGNGTNGGNAAAACAAC 3* 
PLOOPAT 5' GGN GTN GGN AAA ACT AC 3* 
PLOOPAC 5' GGN GTN GGN AAA ACC AC 3* 
PLOOPGG 5* GGNGTNGGNAAGACGAC 3' 
PLOOPGA 5* GGNGTNGGNAAGACAAC 3* 
PLOOPGT 5* GGN GTN GGN AAG ACT AC 3' 
PLOOPGC 5' GGN GTN GGN AAG ACC AC 3* 
Antisense primers based on GLPLAL amino acid sequence: 

GLPLl 5' AGN GCN AGN GGN AGG CC J* 

GLPL2 5' AGN GCN AGN GGN AGA CC 3* r 

GLPL3 5' AGN GCN AGN GGN AGT CC 3* ' 

GLPL4 5- AGN GCN AGN GGN AGC CC 3' 

GLPL5 f AANGCC AANGGC AAACC 3* 

GLPL6 5* AAN GCC AAN GGC AAT CC 3* 
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TABLE 2. Characteristics of RLGs isolated from lettuce. 





Template 


Primers 


Number' 


Size^ 
(bp) 


Copy 
niimber'' 


X>m 
linkage 


RIiGl 


genomic DNA 
cDNA 

genomic DNA 
cDNA 


PL00P6A-i>GZiPL6 
PL00PGA+GLPL6 
PL00PAA<I-6LPL6 
PL00PAA+GLPL6 


B/6 
1/5 
5/5 
1/1 


522 




DM4, 
DJttlJ 


RL62 


BACH8 


PL00PGG-I-GLPL3 


3/3 


510 




DM1, 
Dm3 


RL63 


gemonic DNA 


PL00PGA+6LPL4 


3/6 


461 




DmS 
DmB 


RL64 


genomic DNA 


PL00PGA-fGLPL4 


1/6 


524 







' Niunber of RLG sequences out of total number of clones sequenced. 

Size of fragment amplified from the nucleotide bindind domain. 
« Estimated copy number from genomic Southern blot analysis and nuntbers of 
15 clones in the BAC library. 



Exanpte 2 

Example 2 describes the genetic analysis used to obtain a preliminary 
indication of the linkage relationships of the amplified products and known clusters of 

20 resistance genes. 

Bulked segregant analysis was performed to obtain a preliminary indication 
of the linkage relationships of the amplified products and known clusters of resistance 
genes, DNA from individuals were pooled for each susceptible and resistant bulk. 
Amplified products were then mapped by RFLP analysis from our intrasjpecific mapping 

25 population. Resistances from four clusters of resistance genes as well as over six hundred 
markers have now been mapped on this population. Linkage analysis was done using 
JIONMAP or MAPMAKER mapping programs. Due to a suppression of recombination in 
the Dm3 region, sequences were mapped relative to DmS using a panel of deletion mutants 
that provided greater genetic resolution than the mapping population (Anderson et al 

30 1996>. All blots were washed twice at 63EC in 2x SSC/1 % SDS for 20 min, followed by 
one wash at 63EC in Ix SSC/0.1% SDS for 10 or 30 min. 
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Most of the RLG sequences were analyzed by bulked segregant analysis 
(BSA) using pools of resistant and susceptible individuals for each of the four clusters of 
resistance genes. In genomic Southern analyses, all the RLGs revealed numerous 
fragments of varying intensity. The numbers of bands was highly dependent of the 
5 stringency of hybridization. BSA demonstrated that RLGl was linked to the Dm4, 7 and 
Dml3 clusters. Segregation analysis confirmed this linkage. 

RLG2 was derived from B AC H8 that was known to be from the Dm3 
region. BSA with RLG2 demonstrated that the polymorphic bands that distinguished the 
parents of our mapping population mapped to the DmhDm3 cluster. Several bands 
10 absolutely cosegregated with Dml or Dm3. To provide finer genetic resolution, RLG2 
was also mapped using a panel of Dm3 deletion mutants. A number of fragments were 
missing in largest deletion mutant demonstrating that several RLG2 family members are 
physically located very close to Dm3, No fragment was missing in all deletion mutants; 
however, this is not unexpected as there is extensive duplication within the region. 

15 

Example 3 

Example 3 describes the screening of a bacterial artificial chromosome 

library. 

Over 53,000 BAG clones containing lettuce genomic DNA were screened 
20 with two of the amplified products. High density filters each containing 1536 clones were 
hybridized to ^-P labelled probes. Filters were washed at 65EC with 40 mM N%PO4/0. 1 % 
SDS for 5 min followed by 20 min in the same solution. 

To isolate additional RLG sequences we screened our genomic BAG library. 
Clones were identified that hybridized to RLGl and RLG2. Nearly all the clones that 
25 hybridized to RLG2 also hybridized to marker AC15 that had already been shown by 

deletion mutant analysis to be clustered around DmJ. This provided further evidence for 
clustering of RLG2 sequences. 

Using primers conserved within each family, part of the NBS was amplified 
from each unique BAG clone and sequenced. This revealed that members within each 
30 family varied from 64% identical at the deduced amino acid level. The most divergent 
members only weakly cross-hybridized to each other. Currently, RLG sequences are 
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considered to be part of the same family of sequences if they are at least 55% identical at . 
the deduced amino acid level and map to the same region of the chromosome. 



Exampte 4; 

5 Example 4 describes the cloning, identification, sequencing and 

characterization of RG polynucleotide sequences; including use of RG sequences from 
plasmid and PGR products. 

Doubled stranded plasmid DNA clones and PGR products were sequenced 
using an ABI377 automated sequencer and fluorescently labelled di-deoxy terminators. 

10 Sequences were assembled using Sequencher (Genecodes), DNAStar (DNAStar) and 
Generics Computer Group (GCG, Madison, WI) software. Database searches were 
performed using BLASTX and FASTA (GCG) algorithms. 

Sequences flanking the NBS region for RLG2 and for some of RLGl were 
obtained by a series of IPCR and the products sequenced directly. IPCR worked less well 

15 for RLGl . Therefore RLGl was subcloned from a BAG clone into pBSK (Stratagene) and 
the double stranded plasmid sequenced by long range sequencing. 

Initially, a total of 30 clones were sequenced. Three of these seven primer 
combinations yielded sequences that comprised continuous open reading frames with 
sequence identity to the NBS of known resistance genes. Seven out of 10 clones amplified 

20 from genomic DNA with the primer pair PL00PGA/GLP6 were 522 bp long; they were 
idenucal to each other and named RLGl. All six clones amplified from genomic DNA or 
cDNA using the primers PL00PAA/GLP6 were sunilar/the same as RLGl. All three 
clones sequenced from BAG clone H8 were 510 bp long, identical to each other but 
different from RLGl and were therefore designated RLG2. The 11 clones sequenced from 

25 four other primer combinations had no similarity to any NBS motifs and therefore were not 
studied further. Therefore, sequencing resulted in the identification of clones containing 
NBS motifs representing four RLG sequences. 

Comparison of the deduced amino acid sequences of RLGl and RLG2 to 
those of known resistance genes revealed that RLGl and RLG2 are as similar to each other 

30 as they are to resistance genes from other species and that this is the same level of identity 
shown between the known resistance genes (Table 3). The percent identity (upper 
quadrant) and percent identity (lower quadrant) were determined using the MEGALIGN 
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routine of the DNASTAR package. Identity refers to the proportion of identical amino 
acids; identity refers to the proportion of identical and similar amino acids and takes into 
account substitutions of amino acids with similar chemical characteristics. RGl and RG2 
are as similar to each other and to cloned resistance genes as cloned resistance genes from 

5 a variety of species are to each other. L6, resistance to MeUmtpsora lini in flax (Lawrence 
et aL , 1995). iV, resistance to tobacco mosaic virus in tobacco (Whitham et aL , 1994). 
PRF, required for resistance to Pseudomonas syringae in tomato. RPS2, resistance to 
Pseudomonas syringae in Arabidopsis thaliana (Bent et al. , 1994; Mindrinos et aL , 1994). 
RPMl , resistance to Pseudomonas syringae pv. maculicola in A. thaliana (Grant et al , 

10 1995). The initial RGl and RG2, sequences were amplified from lettuce using degenerate 
primers. 



15 



20 



Lettuce 

Lettuce 

Lettuce 

Lettuce 

Tobacco 

Arabidopsis 



Table 3 
roENTITIES OF 
RESISTANCE GENE HOMOLOGUES 





RGl 


RG2 


RG3 


RG4 


N gene 


RPS2 


RG1 


• • • 


22.7 


15.0 


29.2 


25.4 


23.8 


RG2 




• «* 


32.2 


21.6 


22.7 


33.0 


RG3 






« • « 


17.2 


15.0 


32.8 


RG4 








#«« 


44.3 


22.7 


N gene 










* •* 


21.6 


RPS2 












« • • 



25 

The regions homologous to the primers are included in this analysis as the 
genomic sequences for RLGl and RLG2 were determined by BPCR. Interestingly, the 
genomic sequences for RLGl exactly matched that of the primers used. 

To obtain further evidence that we had amplified resistance genes, we 
30 amplified the regions flanking the NBSs of RLGl a and RLG2a by IPCR of BAC clones. 
These products were then directly sequenced without cloning to minimize the introduction 
of PGR artifacts. Sequence analysis of the 5' regions failed to detect any homology to 
known resistance genes. However, the sequence of the 3' region contained leucine-rich 
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repeats (LRRs). When this sequence was used to search GENBANK using BLASTX, it - 
detected identity to the Arabidopsis resistance gene, RPS2. This region does not contain as 
regular LRRs as in some resistance genes; however, the repeat strucdire seems to be 
consistent with that of the flax resistance gene, L6. Therefore, the presence of an LRR 
5 region is further evidence that the sequences we amplified using degenerate oligonucleotide 
primers are probably resistance genes. 

The sequences of the IPCR products also provided the genomic sequences of 
the regions complementary to the sequences of the degenerate oligonucleotide primers. 
The genomic sequences for RLGl were identical to one of the primers m the mixture. 
10 The RLG sequences are resistance genes as supported by three criteria: the presence of 
multiple sequence motifs characteristic of resistance genes, genetic cosegregation with 
known resistance genes, and their existence as clustered multi-gene families. The presence 
of LRR regions in a similar position relative to the NBS as in cloned resistance genes 
provides stronger evidence than relying solely sequence similarity between NBS regions. 
15 The clustering of RLG sequences at the same position as the known clusters of resistance 
genes make them strong candidates for encoding resistance genes. The hybridization 
patterns and genetic distribution of die RLG sequences are similar to that of cloned 
resistance genes in other species. Most of these hybridize to small multigene families and 
preliminary genetic evidence indicates that they are chistered m the genome. Therefore, 
20 the degenerate primers tiiat we designed from other resistance genes seemed to have been 
specific enough to amplify resistance genes rather dian P-loop contaming proteins in 
general. 



25 



(intentionally left blank) 



30 
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1 ATCGTAAC0C7ITCGTACGAG AKC GC TGrC CClt^ ' C n CA TC U'lTItJ'l^JATAT GTCATAT ^C TCAm^ TItrTGCC ACAIOT 
81 AA aTl ' :UXU;rr A Trri ' A AA TTAA rm ' rA T TCCACATCST catxttatgagtititctat tttattgagtttcacataat 
16X ATTTAAATGTAATAACAATA AATCXZATATXTATTrrTOT TAAATAAA O GCATRTAA TAT ATA GATTAAAA'^ CA TATRA T 
241 ACATAQGTrAAACTCATATA ATACAIATGTTCA TOOOC AG TTTATn RT yXUIC TCATCC TTAATTI RTriA 'r m'lTO r 
321 TTATTAGAGTAGATGATCTT TGTGATATTAAAAATnAAT TTtfnUAARATlTOAA ATTA TTAAT AATOCXaCAA'XTrUA 
401 ATAAAATTAAAAAAAATGQI CCCftCCATrRGT CCATCA Cr 'X TX'XUtfSCICA'ICAAXR.TCG TGASr ATTC10L TXlX»'XTlt: 
4B1 CAOOCTAATCAATATTICCA GOGAATGA CAGRCXTX'KROG GafXTXVXUftA YX''XWG TTC OGACACT CTTCKn GAAOGA 
561 GATAATAAATCAAATGGAGC TGCTOCAATGTXCATTGCTG ATGAAAGGTG^TTGTRTGT GAAGANAATGTCAGCGATOI 
641 ATCTCCATCCGGAACCCACC ACaOTATCMTCTACCACCA AACCACICAA AACGG YGGAA GTAGRRAKA aVRKAAAglCA 
721 TGAkGAATAGATTATTTTTG TCCTCATGGGCTGACTGAGG AGCGGGmAGTTCATCATT •I'X'XVXTXUAMCAAAGAATTA 
801 ' X XS G GTC CA TCGAA ITITXA C ATCGACAAAGAAGTITCACT TCX3CAA'Xl»XTi"il»rXAAACA ATTTTTAATCTTTTTATCTT 
BBl rXtXrX ' XU AAACTCCT CA ATT GCAACrrGCAACITGCAACT TTTGGGCCCACAAATTTCIG GTGGGCGTTAATTTAATCCA 
961 CATATTCACTGTAAACAATA ATTGAAATCGATCTCTGTTC ATCCAATTCATCAACATCTC TTGATAATTGAAATCATTCA 
1041 CGCTTCr^TCXATITCATCCA CATCTAmrTATATTCTCTG CTOTATCATATOAAAOGAT GGCTGAAATU;ril.Ti'Xt:X>G 
1121. C L ' l ' XVl ' lXj ACA tjlljCj ' llJi ' l ' r GAAAAGCTGGCATVTGAAGC CTTSAAGAAGATrGTTCGCT CCAAAAGAATTGAAICIGAG 
1201 CTTAAGAAATTGAAGGAGAC ATIAGACCAAATCCAAGATC TGCrTAACGAIGCITCC^G AAGG AAGTA ACIAATGAAGC 
1281 CGTTAAAAGATGGCTGAATG ATCTCCAACATrTGGCnAT GACATAGAOGAOCTACXTGA TGATOTTSCAACTGAAGCXG 
U61 TTCAiXGrTGACsITGACCGAG GAGGGTGGAGCCTCCICCAG TATGSTAAGRAAACTAA JCC CAAGTIGTTGCACAAGITTC 
1441 TCACAAAGTAATAGGATGCA TGCCAAGITAGATGATATIG CCftCCAUUTTACAAGAACIG GTAGAGGCAAAAAATAAICT 
1S21 TGGI7TAAQTGTGATAACAT ATGAAAAGCCAAAAATTGAA AOGIATGAGGCGICZTIGGT AGATGAAAGOGGTACTSICG 
1601 GA OriUA AGAIGATAAG&AA AAATTGCTGGAGAAGCTGTT GGGGGATAAAGATGAATCAG GGAGTCAAAACXTCAGCATC 
16B1 GT GC CCATAU'X'XGG'rATG GG TGGAGTrOGTAAAACAACTC TAGCTAGACXTX'XUTATGAT GAAAAGAAAGTGAAGGATCA 
1761 CTlCGAACTCAGGGCrroGG ' ri ' XU ' XlJXTXl. ' X U ATGASTT C ACTGTTCCCAATATAAGCAG AGTIATITATCAATCTCIG^ 
1841 C T G G GGAAAAGRAGGAGTTT GAAGAdTAAATCIGCTICA AGAAGCICTEAAAGAGAAAC TTAGGAACCAGCTAITTCTA 
1921 ATACmTCGATGATGTGTG GTCTGAAAGCTATGCTGATT GGGASAAATTAGTGGGCCCA TICCTTGCGGGGTCTOCTOG 
2001 AAGTAGAATAATCATGACAA CTCGGAAGGAGCAATTGCTC AGAAAGCTGGGCXTX 'XXJTXCA TCAAGACCCTCTGGAGGGTC 
2081 TATCACAAGATGATGCTTTG TCTTIGTlTaCICAACACQC ATrKXyiGTACCAAACTTTG ATTCACATCCAACACTAAOS 
2161 CCACATGGAGAACTGTTTGT GAAGAAATGTGATGGCTTAC CTCTAGCriTAAGAACACTr GGAAGCjX'X'A'X'X'AAGGACAAA 
2241 AACAGACGAGGAACAATGGA AGGAGCIglTGGATAG TGAG ATATGGAGGT IR GGAAAGAG CGATGAGAT XIJX'XX.'mUCIC 
2321 TTAGACTAAGCTACAATGAT LTX'XUXUCCC'X."X*X'XtaAAGCT RTZltnTOCATAYTGCZCCr TGmCCCAAQGACTATGAG 
2401 TTTGACAAGGAGGAGTZTGAT TCXATTGTGGATGGCAGAAG UJXTX'X'XUCRCCA A^AACT AVAAACAAGTCAAAGCAA06 
2481 KTTGGSTCrrGAATATZTIR AAGnU'X'Xl^fX'XU'XtJAAGftXCR TrTTTTCAACATGCTOCTAA TKRCAAATCSTICTTTSTGA 
2561 TGCATGACCXAATGAATGAT TIt3GCTAC ArX TC rXCC T GG AGA A ' X TXTXT ICAAGGTTAG ACATAGAGATGAA^AGGAA 
2641 TTTAGGATCSAATCTTTCXaA RAAGCACCQ-XATATGTCAT TrGTATGTGAGRATTACATA GGTTACAAAARGTrCGAGOC 
2721 ATTTAGAGGAGCTAAAAATT TGAGAACA1*X'XTX!AGCATTG TCTGTTGGGGIGGTAGAAGA TIGGAAGATGnTTACrtAT 
2801 CAAACAAGGTCTTGAATGAC V^r^ACT^CARGATT^ACCAT^ GrnUUSGGrrCCTRAKTITGA TTRRICITAYAATAASYRAG 
2881 GTACCARAAICrcGTSGGTAG TATGAASCACTTGCGGTATC TTAATCTATCtXSRAACTT/JA ATCACMCATTTACCGGAAWA 
2961 TKTCTGCAATCTTTATAATT TACARACCCTGATTGTKICT GGCTGTGAMrATTTAGTTAA KITGCCCAARACCnCTCAA 
3041 ASCrrAAAAATTIGCASCAT TTTGACATGAGGGRTACTCC KAMOTrRAARAACATGCCCT TARGGATTGGTGARTTGAAA 
3121 ARTCTACAAA CXVXV rri^ g TAACATTGGCATAGCAATAA C CGAGC TTAAGR ACTTGCAM AAYCTC CATCG GAAAR TITG 
3201 TA'X'mjCGGGCTGGGAAAAA TGGJUiAATGCMGTKGGATGC AGCaTAAGOGAACTTGTCXC AAAAAAOGTTtZAAlGARITA 
3281 NAAACIG^TI^GGOGGXGA Tl^TTTAATGTTETCCGAA ATGG GAACft CITG AAAAAG A A a TCCTC A ATGAAG TGATGC 
3361 CICATAATGGTACXCTAIAA AAAACCCAIAATTATGTCTA TAGGGGGTATAGAtJXTXtXIA AAnGGGTTGGTIT^CACIAA 
3441 GUiXTXVXUAAACTAGAGAT GTGITCA3GGTGTA1GAAAA AGMTICmTAOGTAGmC ATCAATCACCAAGTGGGAAA 
3521 TAGATGATATTTTCAGGGC/ TACTGATGAGATGTGGAGAG GTAlGATAGGGTTfrCTTGGG GCGGTAGAAGAAATAAGCAT 
3601 CC A ' X ' lCritJrA ATGAAATAA GATATVTGTGGGAATCAGAA GCAGAGGCAAGTAAGGTTCT TATGAATTTAAAGAAGTTGG 
3681 ATTTAGGTGAATGTGAAAAT TTGGTGAGTZTACGGGAGAA AAAGGAGGATAATCATAATA TTAATAGTGGGAGCAGCCTA 
3761 AGATCmTAGGAGGTPGAA TGTATGGAGATGrrAACAGCT TGGAGCATTGCAGGrrGTCCA GATAGCATOGAGAATTICTA 
3841 TATGCACATGTGTGATTCAA Tt^ACATCXXTTCTCCTraXA ACAGGAGGAGGACAGAAGAT CAAGTCACTtACCATCACTG 
3921 ATrGCAAGAAGCTTTCGGAA GAGGAGTTGGGAGGACGAGA GAGGACAAGAGTGCTrATAA ACTCAAAAATGCAGATGCTT 
4001 GAATCAGTAGATATACGTAA TTGGCCAAATCTGAAATCTA TCZAGrGAATTGAGTrGCTZC ATXCACCTGAACAGATTATA 
4081 TATATCAAACTOICCGAGTR TGGAGlt:ATTTCCTGACCAT GAGTTGCCAAATCTCAOCTC CTTAACAGATCGAAGGAGAG 
4161 aACAGCGArrrrCGTACXaA CXXrrrACGATrCGACXGGCC CXCGTX ' XT 
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1 AAOCCnOST ACGAGAMOG CXtJl U lTCTC CTTCCTCrAA TATAATGATA AGAAAAAATA TGATTAAAQG 
71 TmAATOCA AAATCCATPA TTCCACCQGT GATATGATGC ACTAGCIGTA GT ATGCAAAA ACAGTATTAT 
141 AAATCCnUVC CAAAACAGCA GCTAAGAAAC AATATAAATA ATGGTITCAA 'lW:U,' mv: TCCCTACAgT 
211 CATTTCTICC AAATCCCTAT CATTCATACA TACAAGTGCT CCCATATTAG CTTTTCACTA 'TAAGCA ATGG 
281 CIX3AAATCCT IGGTTCTGCG TTCTITGCGG TGTTCTTIGA AAAGCTTCCT TCTGAAGCCT TGAAG AGGGT 
351 TCCTTCCrrCC AAAGTAATTG ACAAGGAGCT CGAGAAATTG AATAGCTCAT GAAT CAATAT AAAAGCICTG 
421 CTCAATGATG CTTCTCAGAA GGAAATAAGT AAGGAAGCTG TTAAAGAATG GTTGAATGCT CTICAA CATT 
491 TCCCTTACGA CATAGATGAT CTACTTQGCG ATTTGGCAAC CAAAGCTA1C CAl XXrrAAGT TCTCTGAGGA 
561 ATACGG3GCC ACCATCAACA AGGTACGAAA GTTAATTCCA TCTTGTTXCT CTAffmCTC A AGTACTAA G 
631 ATGCGCAACA AGATACATAA TATTACCAGC AAGTTACAAG AACTATZ3USA AGftfiftGAAA T AATCITQGAT 
701 TATOIGAAAT TGGTGAAAGC CGAAAACTTC GAAATAGAAA ATCAGAGACC TCTNTG^G ATCCRT CrAO 
771 TATOTTOGA CX3CACAGATG ATAAGGAAGC UTlUri'lCTC AAG CTATA TC AAOC ATOIGA TAGAAACXTT 
B41 AGCATCTTCC CNATACSTIGG TATGGGTOGG TTAGATAAGA C CACm 'AGG TAGACTTTTG TOIGAT NAAA 
911 TGCAAGTOAA GGATCACTTC GAACTCAAGG CXSTGGGTITG TGTTnCIGAT GAGTITCATA TCTTOOG^ 
981 AAGCAAAACC ATrTTCGAAT CGATAGAGGG GGGAAACCAA GAGTTTAAGG ATTTAAATCT GCTTCAGGTG 
lOSl GCnTAAAGG AGAAAATCTC AAAGAAACGA rnvriUl ' IU TTCTTGATGA TGTATGGAGC GAGAGCTATA 
1121 CTGATIQGGA AATTCTAGAA CGTCCATTIC TAGCAGGAGC ACCAGGAAGT AAA CTAATCA TCACAACCCG 
1191 CAAGTrtSTCG TTGCTAAACC AATICGGTCA TGATCAACCA TACCAATTGT GTG^TTGTC ACATGACAAT 
1261 GCTCTATCCT TATTTrGTCA ACACGCATTT GGTGTAAATA GC TTTGA TTC ACA TXXGA TA CTTAAAOCAC 
1331 ATCGTCAAGG TATTOnTGAA AAATGTGATG GTTTCCCATT GGCITTGATr GCACTTGGGA GGTTATTGAG 
1401 GACAAAAAGA GATGAGGAAG AATGGAAGGA ACTATTGAAT AGTOAG ATAT GGAGGT TAGG AAAGAG AGAT 
1471 GAGATTATTC CGGVTCTTAG ACEAAGCTAT AATGATCTTT ClUC L'lUl'lT GAAGCAGTTG ITIU MATT 
1541 GCTCCTTGTT CCCCAAAGAC TATOlGnCA ACAAGGAGAA G TTGATl'i 'm TTATGG ATGG CAGAAGGGT T 
1611 TTIGCACAAT GAAAATACAA ACAAGTCAAT GGAACGCTIA OTTCTTGAAT ATITIUAOGA Criia TOrCA 
1681 AGGTCATTTT 1TCAACATGC ACTOGATGAC AAATOC5TTGT TTQTGGTGCA CGACCT CATC AATGACTTQG 
1751 CCACATOTCT TGCIGGAGAT TATTTTTEAA GATT AGACA T TGAAATGAAA AAQGAAGCTT TGGAAAAATA 
1821 COGACATATG TCATITGnT GTGAGAGTTA CATGGTTTAC AAAAGGTTCG AAC CATTTA A AGGAGCEAAA 
1891 AAATTGAGAA CnTCTTAGC AATGCCTGTT GGGATGATAA AAAGTTGGAC AACATTTTAC 1TATCAAATA 
1961 AGGTCCTPGA TGACTTACTT CACGAATTAC CAirGTTGAG AGnTCTAAGT TIGAGTTATC TTAGCATCAA 
2031 GGAGGTACCT GAAATAATAG GCAATTTGAA ACACTTGCGG TATCTTAAIT TATCACACAC GAGTATC ACA 
2101 CATTTACCAG AAAATGTCTG CAATCTTTAC AACTTACAAA CATTGATCCT TTGrrGGCTCT 'l^JiTrr ATAA 
2171 CCAAGTTrCC CAACAACTTC TTAAAGCTTA GAAATTTACG GCATTT^AC ATTAGCXaVEA CTCCCGGTIT 
2241 GAAGAAGATG TCCTCGGGGA TTGGTGAATT GAAG AACCTA CACACYCTCT CCAAGCTCAT TATTQGAGGT 
2311 GAAAA-SySAC TAAACGAGCT TAAGAACTTA CAAAATCTCC ATG 
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RTGlb - Diana 
[Strarxi] 



1 TACTACTACT AGAATICGC?r GTTGGTAAGA CGAdXCTAGC TAGACTTTTG TATGAGGAAA TGCAAGQGAA 

71 GGATCA CTTC GAACTTAAGG CGTGGGTATG TGTTTCTGAT GAGTITGATA TCTTCAATAT AAQCAAAATT 

141 ATCTTACAAT CGATAGC5TGG TGGAAACCAA GAATPTACGG ACTTAAAOCT GCTTCGAGTA GCTTTAAWUS 

211 AGAAGATc^rC AAAGAAAAGa TTrCTTCITC TTCTTGATGA TGnTGGAGT GAAAGCTATA CeXtA TrS GGA 

281 AATTTTTAGAA CGCXXZATTTC TTGCAGGGGC ACCTGGAAGT AAGATTATTA TCACCACCCG GAAGCTGTC A 

351 TTOTTAAACA AACTCGGTTA CAATCAACCT TACAACCTiT CGGTTTIGTC ACATCAGAAT GCTITGTCTr 

421 T ATICICT CA GC ATGCAT TG GGTGAAGATA ACTTCAATTC ACATCCAACA CTTAAACCAC ATGGCGnAQS 

491 TATTCTZGAA AAATGTQATC* GeOTGCCATT GGGATTGICG ACATGA3GAT GATG' 
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[Stxanci] 



1 TCOCG TCCA A. CX3T:?rATCAT rPCAGAAQCC CCAAAGACCA HAGMNTOZT TMaXSSmSHrV imCAGAAGS 

71 AAGXAATTGTl TG^^AGCIGXN AAAAGAIGGC TGMTGATNrr GCAACAATTS GCTIAGGAC^ CZGANGSAOCl 

141 ACTTGATSAT NIOGCAACAG AAQCmTrcA TCGTQftGITG ATCCC3TGAAA CIGGAOCITC QCCAGOITS 

211 GTAAG^^AAGC TAATCCCAAG TIGTIGCAC\ AGTITCirCAC AAAGTAATAG GATOCftilGCC AGGTIAGAIX: 

281 ATOT TOC C G C TAAGItACAA GAACTQC3TAG AGGCX aftRAAR TAATCITGGr TTAAGXCnGA TAACATAOGA 

351 AAAACCCAAA ATTGAAAGAG ATGAGGCGITT TTTGGTAGAT GCAAGTGGTA TCATIGGACG TGAAGATGAT 

421 AAGAAAAAAT TGCTTCAGAA .GCTGnTGGGG GATACTTATC AATCAAGTAG TCAAAACTTC AACATCXSTOC 

491 CCATAGTTGG TATGGGTGGG GTAGGTAAAA CAACTCTAGC TAGACTTTTG TATGATGAAA AAAAAGTGAA 

561 GGATCAOTC GAACTCAGGG TTTGGGTTTG TGTTICTGAT GAGTTCAGTG TTCCCAATAT AAGCAGAGTT 

631 ATCTATCAAT CTGTGACTGG TGAAAACAAA GAATTTGCAG ATTTAAATCT GCTTCAAGAA GCCCTTAAAG 

701 AGAAACTTCA GAACAAACTA TTTCTAATAG TTITAGATGA TGTATGGTCT GAAAGCTATG GTGATTGGGA 

771 GAAATTAGTG GGCCCATITC ATGCTGGGAC ITCTCGAAGT AGAATAATCA TGACTACICG GAAGGAGCAA 

841 TTACTCAAAC AGCTGGGnTT TTCTCATGAA GACCCTCTGC ATAGTATAGA CTCCCTGCAA CX3TCTATCAC 

91X AAGAAGATGC ' iT l UlCiTlU TTITCTCAAC ACGCATITGG TGTACCTAAC TTTGATTCAC ATCCAACACT 

981 AAGGCCATAT GGGGAACAGT TTQTGAAAAA ATGTGGGOGA TTGCCTTTGG CLTIW 
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I anACCJTTC TACGAGATCG CTGTCCCTCC TCGATCTGCT TAACGATGCT TCCCAGAAGG AAGTWACTAA 

71 TCSAAGCOGTr AAAAGATGGC TCAATGATCT CCAACATTTG GCTTATGACA TANACGACCT ACTTGATGAT 

141 CTTGCAACAS AAAGCTATTC t3T CSTGAGT T GACCGANGAA QGTCGAC3CCT CCACCAGTAT GGTAAGAAVi 

2U CTAATCOCAA GITOnGCAC AAGTTTCTCA CSU^AGTTATR GGATGCATGC CRAGTEAGAT GATATTGCCA 

281 CCAGGTTACA AGAACIGGTA GAGGC AAAAA ATAATCTIGG TTTAAGIGZG AfCAACATATG AAAAGCCCAA 

351 AAT7GAAAGG TA TGRGGCA T CTXTGGTAGA CGAAAGIGGT ATnTTGGAC GITOAGATGA TTOWSAAAAAA 

421 TTGATGGAGA A^TGTIGGA GGATAAA6AT GAATCCGGAG TdlAAACITC AGCAXCCTGC CCATAATIGS 

491 TATQGS7GGA GTTGGCMAAA CAACICIAGC TAGACTCTTG TTTQATGAAA AGACAGTGAA GGATCACITC 

561 GAACTCAGGG CTTGGGTTTG TGTITCTGAT GAATTCAGTA TTCTCAACAT AAGCAAAGTT ATCTATCAAT 

631 ctgtgaccx;g ggaaaagaaa gagtttgaag acttaaatct gcttcaagaa gctcttagag ggaaactaca 

701 AAACAAACTA TTTCrAATAG TTTTCGATGA TGTATGGTCG GAAAGCTATG GTGATTGGGA GAAATTAGTG 

771 GGCCC-TTTC ATGCTGGGAC 1TCTGGAAGT AGAATAATCA TGACTACTCG GAAGGAGCAA TTACTCAAAC 

841 AGTTGGGTrT TTCTCATCAA GACCCTCTGC GTTGTATAGA CTCCCTGCAA CX5TCTATCAC AAGATQATGC 

911 rrm ' it. - irm tttgctcaac acggatttgg tgwcca 
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PLGIE 
[Strand] 



1 ICTMCTAGA CmTCTATG ACGAGATGCA AGAGAAGGAT CACTTCGAAC TCAAG GCGT G GGTTroTCTT 

71 TCTGATGAGT TIGATATATT CAATATAAGC AAAATTATTT TCCAATCGAT AGGAGG TOGA AA CCAAGAAT 

141 TTAAGGACrr AAATCTCCTT CAAGTAGCTG TAAAAGAGAA GATITCAAAG A AACGATITC TACTTGTT5T 

211 IGAIGAIGTT TGGAGTGAAA GCTATGC3GGA TIGGGAAATT CTQGAAOGCX: Ca.'lTi'L 'nU: AGGGGCAGOC 

281 GGAACTAAAA TTATCATGAC GAOCOSGAAG CAGICATIGC TAAOCAAACT CUtfm CAAG CAACCTIACA 

351 ACCTTTCCGT TTIGTCACAT GACaGT GC IC TCTCTTTATr CTOTCRGCAT GCA-nm aU AAGATAACTT 

421 CXSATTCACAT CCAACACTTA AACCACATGG CGAAGGCATT GTTGAAAAAT GTGCT 




wo 98/30083 



PCT/US98/00615 



59 



ELJ3IF 
[Strang 



1 ATTrra:GCT QIAAACAAAN AAAAGCAATG GCTGAAATCT TTCmraGC ATTCTAGACC AGTATTCITT 

71 GAAAAGNTGG CTTCTGAAGC CTTGAAGAAG ATCGCTCGCT TCCATCGGAT TGATTCTGAG CTCAAGAAAC 

141 TGAAGAGGTC ATTAATCCAG ATCAGATCTG TGCTTAATGA TGCTTCTGAG AAGGAAATAA GTGATGAAqC 

211 TGTTAAAGAA TGGCTGAATG GTCTCCAACA TrrGTCTTAC GACATAGACG ACCTACTTGA TGATTTCGCA 

2B1 ACCGAAACTA TGCATCGTGA GTTGACCCAC GGATCTGGAG CCTCCACCAG CTTGTAAGAA AGATAATCCC 

351 AACTTGTTGC ACSySATTTCT CACTAAGTAG TAAGATGCGT AACAAGTTAG ATAATATTAC GATCAAGTTA 

421 CAAGAACTGG TAGAGGAAAA AGATAATCTT GGCTTAAGTG TGAAAOGTGA AAGCOCAAAA CATACCAACA 

491 GAAGATTACA GACCTCTTIG GTAGATGCAT CTAGCATTAT TGGTOGTG 3 A GGZtSATAAGG ATGCAITCCT 

561 CCATAAGCTQ CTGGAGGATG AACCAAGTGA TAGAAACTTT AGCATCGflGC CAATAGTTGG TATCGGTSGT 

631 GTGGCjTAAGA CXSVCICIAGC TAGACZmG TATGACGA6A TGCAAGAGAA GGATCACITC GAACTCAAG8 

701 CGIGGGTITG TCTITCIGAT GAGTTIGATA TCTTCAATAT AAGCAAAGTT ATCTTCCAAT CGATAGGTQG 

771 TGGARACCAA GAATTTAAGG ACTTAAATCT CCTTCAAGTA GCTGTAAAAG AGAAGATTTC AAAGAAAOGA 

841 TTTCr.JYTTG TTCTGGATGA TOnTCGAGT GAAAGCTATA CAGAATGGGA AATTCTAGCA CGTCCATTTC 

911 TTGCAGGGGC ACCAGGAAGT AA GATIATC A TGACGACCCG GAAGTTGTCG TTGCTAACCA AACTCGGTTA 

981. CAATCAACCTT TACAACCTTT CSGTriTGTC ACATGATAAT GCTYTGTCTT TATTCTGTCA GCAYGCAITG 

1051 GCTGAAGATA ACTTOGATTC ACATCCAACA CTTAAACXZAC ASGGTGAAAG TATTOTTGAA AAATGTGACG 

1121 GTITACCATT GGCTTTRATT GCACTTGGGA GRTTGTTGAR GACAAAAACA GATGAGGAAG AATGGAARGA 

1191 AGTGTTGAAT AGTG AAATAT GGGGG TCAGG AAAGGGAGAT GAGATTGTTC CGGCTCTTAA ACTAAGCTAC 

1261 AATGATCTCT CTGCCTCrrr GAAGAAGTTG TTTGCATACT GCTCCTTGTT CCCAAAAGAC TATGTGTTCG 

1331 ATAAGGAGGA GTrGATITIG TTGTGGATGG CAGAAGGGTT TTTGCACCAA TCAACCACAA GCAAGTCBAT 

1401 GGAACGCTTG GGKCATGAAG GmTGATGA ATrOTICTCA AGATC A ' ITIT TTCAACATGC CCCIGATGCr: 

1471 AAATCGATGT TIGTGATGCA TGAOC TGAT G AATGACTTGG CHACAICTGT TGCTGGAGAT TTTTTTICAA 

1541 GGATC GACAT TCAGATGAAG AARGAATTTA GGAAGGAAGC TTTSSAAAAG YAYCGCCATA OGTCAimCT 

1611 TTGTGAKGAT TACATGGTKi; ACAAAAGGTT CRAGCCATTS ACAAGGAGCT AG 
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RffflG 
tStrani] 



1 GTGM-SGATC ACTTCGAACT CAGGGCITGG CiTilTlVX ' iT CTGATGAATT TAATATCCTC AATATAAGCA 

71 AAGTAATTTA TCAATCTCTA ACCGGGGAAA AAAAGGAGTT TGAAGACTTA AATCTGCTTC AA GAAGCT CT 

141 TAAAQAAAAA CTrTGGAATC AGTrATTTCT AATAGTTCTG GATGATGTCT GGTCTGAAAG CTATCGTGi^T 

211 TGCSSAGAAAT TAGTGGGCCC A TlTm ' rC G GGGTCTCCTG GAAGTATGAT TATCATGACA ACTCGG AAGG 

291 AGCAATTOCC AAGAAAGCTG G UTI T IXJ CTC ATCAAGACCC TTTGCAAGGT CTATCACATG ACGATGCTTT 

351 U ' lCri ' lUTlT GCICAACAOS CATTIGGTCT ACCA 
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KDSIH 
[Stzarxil 



1 TCTAGCTAGA CTITICTATG AGGAAATGCA AGGGAAGGAT CACTTCGAAC TCAAGGCGTG GGTATGTGTT 

71 TCIGATSAGT TTSATATCTT CAATAT2UU3C AAAATEATCT TACAATCX3AT AG G TG G TOGA AACCAAGAAT 

141 '1TALX;C IACTT AAACCTGCTT CAAGTAGCIT TAAAAGAGAA GATCXCAAAG AAAAGATTIC TVj n XJfl'iX^V 

211 TGATGAIGIT TGQAGT6AAA GCTATACOGA TTGGGAAATT CTAGAAOGCC CAmCTTGC AGGGGCAGCT 

2B1 G GAAGTZAAGA TTATTATCAC CACCCG6AAG CIGTCATIGT TAAACAAACT OGGTCACAAT CAACCTTACA 

351 ACCTTTOGGT TTTGTCACAT GAGAATGCTT TGTCTTTATT CTGTCAGCAT GCATIGQGTG AAGATAACTT 

421 CAATICACAT OCAACACTTA AACCACATQG CX^AAGGTATT GTIGAAAAAT GIGAT 
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RD31I 
[Strand] 



1 TTO^GCTAGA CTTGTGTATG ATGAGATGCA AGAGAAGGAT CACTITGAAC TCAAGGCGTG GGTATGTGTT 

71 tctg;^tgagt ttgatatatt caatataagc aaaattattt tccaatcgat aggaggtgga aaccaagaat 

141 ttaaggactt aaacctcctt caagtagctg taaaagagaa gattttaaag aaacgatttc ttcttgtrgt 

211 'tcacgacgtt tggagtgaaa gctatgcca^^ ttgggaaatt rttggaacgcc catttcitgc aggggcagoc 

2b1 ggaagtaaaa ttatcatgac aacccgaaag cag tcat igc taaccaaact cggttacaag caaccttaca 

351 ACrrrrrCCGT TrrGTCACAX aACAGTGCTC TGTCTITATT CTGTCAGCAT GCATTGGGTG AAGGTAACrr 

421 CGATTCACAT CCAACACTTA AACCACATGG CGAAGGCA1T GTTGAAAAAT GTGCrGGATT GCCATIQGCA 

491 TTCrrCGACA 
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[Strand] 



1 ThCmrr?£rT AGAATTCGGT GTTOGTAAG;^ CXSAglCraG C TAGACmrS TATCAC GAAft. tgcaagggaa 

71 GGATCACrrc GAACITAAGG CGTQGGZATG TGTnCZGAT GAGTrTGATA TCTZCAATAT AAGCAAAATT 

141 A'iT-TrACAAT GGATAUUIGG TGGAAACCAA GAACTTACGG ACITAAACCT GCTTOGAGTA OCTTTAAAi^ 

211 AGAAGAT^ AAAGAAAAGa ITiCrrCI 'I G TICTIGATGA TCmCGAGT GAAAGCIATA CCGATIGGGA 

281 AATIOTAGAA CGCCCATXIC OTGCAGGGGC ACCTGGAAGT AAGATTATTA TCAOCACCCG GAAGCZGTCA 

351 TTCITAAACA AACTCGGTTA CAATCAACCT TACAACCTTT CGGTTTTGTC ACATGAGAAT GCTTIGICIT 

421 TATICTGTCA GCATGCATTG GGTGAAGATA ACT TCAATIC ACATCCAACA CTTAAAOCM ATQGCGlAaS 

491 TATTGTTGAA AAATGTGATG GaETTGCCATT GGCATZCTCG ACATGAT6AT GATQ 
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I\m/RTR?LSLLHLLSYVIFS?I?PH?ILWLF.INFYSTCHFMSFSILLSFT.YLNVITINAYLFFFK.THI1YR 

LKSYNTVKU.YICSSFVYLYVSSUYLLHY.SR.SLY.KFNLFKI.NY..SHNLNK1KKNGPTISPSLFQUNIV 

SILmFHPNQYFQRI^SYGVSEFAFRHCSLKBINQMELLQCSliWKGELYVK?MSAl?l>lPEPTrLSV 

YHQTTQNGGSR?T?KS.RlDYFCPHGLTEERV.RlFL?KNYRSlEFLHRQRSFTSQCFVKQFUFLSFaNS 

S1ATCNLGLLGPQICGGR.FNPHIHCKQ.FKSISVHPIHQHLU1EI1HASS1SSTSILYSLLLSY.TMAEIVLS 

AFLTWFE<LA?EALKKI\mSKRIESELKKIJ<EriJX3IQDLLNDASQKE\n-NEAV^^ 

D1JJ3D?ATEAV?RELTEEGGASSSMVRKUPSCCTBFSQSNRMHAKLDD1ATRLQELVEAKNNLGLSV1 

TYEKPKlEHYEASLVDESGWGREDDKKKLLEKLLGDKDESGSQNFSIVPIVGMGGVGfCrrU^ 

KNMDHFELFJAWVCVSDEFSVPNISRWQSVTGEKKEFEDLNLLQEALKEKLBNQm-lVm 

GDWEKLVGPFUGSFGSRIIimRKEQLLRKLGFSHQDPLEGLSQDDALSLFAQHAFGVPNFDSHPTLR 

PHGEU=VKKGDGlPULHTLGRLLRTKroEEQWKElJJJSElWRLGKSDEIVPAmL^^ 

YCSLFPKDYEFDKEELlU.WMAEGFlJraPT?NKSKQRIjGI.EYF7B±SRSFFQHAPN?KSL^ 

LATFVAGE?FSRIJDie;lKKEFRM?SLEKHRHMSFVCE?YIGYK?FEPFRGAKNLBTFL^VGWEDWK 

MFYLSNKVLND?LQDLPLLRVL?L1?L?I??VP??VGSM?HLRYLNLS?T?ITHLPE??CNLYNLQTL1V 

SGC?YLV?LPiaFS?LKNL7HFDMR?TP?LKNMPL7IGELK?LQTLF?NIGlAITELKNL7NLHGK?CIGG 

LGKMENAVGCTLSELVSKKV?.??NW??G..1.CFPKWEHLKKKSSMK.CUMVL?KKP?IMS1GG1EFPN 

WVGSLRVSETRDVR.1VYEK?CFT.FHQSPSGK.MiFSG?TDEMWRGMIG?LGAVEElSIHSCNEIRYLWE 

SEAEASKVUWlNLKKmLGECENLVSLGEKKEDNHNII^SSLTSFRRlJMWR 

MHMCDS?TSVSFPTGGGQI<IKSLTrroGKKLSEEELGGRERTRVUNSKMQMl£SVDIR^WNLKSlSB. 
SCFIHLNRLYISNCPS7ESFPDHELPNLTSLTDRRRGQRFSYERLRFDWPSF 
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NRSYENRCPLLPVI...EKI.LKV.IQNPLFHR.YDALAWCKNSIINANQNSS.ET1.IMV.IVLSPYTHFFQIPII 

HTYKCSHIRFSUVMAEILGSAFFAVFFEKLASEALKRVACSKVIDKELH<LJ^SSJNIKA^ 

KEAVKEVVU^ALQHU'YDIDDlXGDU^TKAIHRKFSEEYGATINKVRKUPSCreSLSSTKMRNKIHNITS 

KLQELLEEaNNLGLCEIGESRK!J?NRKSEre?LDPSSlVGRTDDKEALiiJ<LYEPCDRNFS 

DiaTLGRLLYD?MQVKDHFELJCAVVVCVSDEFDIFGISICnFESIEGGNQEFKDL^LLQVAU^ 

WmDVWSESYTOWEll^RPFlAGAPGSKVIITTRKlSIJJ^QLGHEXaPYQl^DI^ 

VNSFDSHPIlKPHGEGIVEKCDGLPl^lAmRIXRTKRDEEEWKEU^ISEIWRLGKro 

l^U<QlJ=AYCSLFPKDYVFNKEmI±WMAEGFmNE^mlKSMEFL?L^^ 

LFWHDUvlNDL^TBVAGDYFlJlLDlEMKKEALEKYRHMSFVCESYimKRFEPFKGAKK^^ 

GMIKSVmFYI^NKVIJ3DLJi4ELPLLR\rt^l^YL5IKB^PEUGNU<HLRYmLSHTSm 

LQTULCGCX)FITKFPNNFIJ<LRNLRHIJJISDTPGLJ<KMSSGIGE1J<NLHTI^ 



H 
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SRAT?1IQK?PKT?D?F????QKEV1DEAVKRWLID?QQLAYDT?D?LDD?ATEAIHREURETGAS7S 

MVRKUPSCCTSFSQSNRMHARLDDIAAK?QELVEAKNNLGLSVITYEKPKERDEA?LVDASGIIGRED 

DKKKLLQKLmDTYESSSQNFNlVPIVGMGGVGHnTlj\RLLYDEKKVKDHFELR\WVCVSDEFSVPNI^ 

RVlYQS\/TGENl<EFADmiXQEAlJ<EKLQNKLFUVLDDVWSESYGDWEKLVGPFmGTSGSRllhm^ 

KECaiXCLGFSHB^PLJHSIDSLQRl^QEDAI^LFSQHAFGVPNFDSHFnJ^PYGEQFVKKCG^ 
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?T7mDRCPSSlCLTMLPRRK?LMKPU<DG.MISNIWLMT?TPi'LMILQ?KAI??ELT?EGGASTSMVRK 

UPSCCTSFSQSYRMHAKLDDIATRLQELVEAKNNLGLSVrrYEKPKiERYEASLVDESGlFGR7DD?KK 

LMEKLLH3KDESGVKLQHiJ=llGMGGVG?TTlJ^Rl±FDEI<T^ 

VTGEKKEFEDUvlLLQEAmGKLQNKmJVlIDDXMSESYGDVVEKLVGPFH^ 

QLGFSHQDPLRCIDSLQRLSQDDALSLPAQHAFG? 




i 



wo 9800010 ^ Kt«w»mis 



URaYDB/QEKDHFaJ<AWVCVSDEFDIFNISKIIFQSIGGGNQEFKDmLLQVAVKEKISKKRFLL^ 

D\AWSESYADWEil£RPFlAGMGSKIIMmKQSLLTKLGYKQPYNLSVI^HDS^ 

DSHPTLKPHGEQIVEKCA 
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FSA?NK?KQWU<SFF?HSRPVFFEK?ASEAU<KIARFHRIDSElJ<KlJ<RSUQlRSVmDASEKEISDEA 

Vi<EWLiJ6LQHLSYDDDLlJDDLATETMHRELTroLEPPPAa<KDNPTCC^ 

QELVEEKDNLGLSVKGESPKKmRRLQTSLVDASSIIGREGDKDAULHKLLEDEPSDRNRIVPIV 

VGICrrLARLJ.YDBkrtQEKDHFEU<AWVCVSDEFDIFNISKVlFQSlQGQ?QEFKDLNLlJ^ 

FL?\rtJDD\MSESVTEWOU\RPnJVGAPGSKIIMTTRKLSIimGYNQPY^ 

LGEDNFDSHPTlXPTGESIVEKCDGIPLAUALGRLLTTKroEEEWKEVmSElWGSGKGDEIVPALK^ 
YhroLSASU<KLFAYCSlJTKDY\TOKEEULl.VVMAEGFL^ 

AKSMFVMHDU«^DIJ^TSVAGDFFSRMD1EMKKEFHKEAL?K?RHMS7VC7DYMWKR 
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VI<DHFE!J=IAWVCVSDEFN1I1JISI<VIYQS\/TGEKKEFEDLJ4LLQEALKEKLWNQ!J^ 
DWEKLVGPFFSGSPGSMIIimBKEQLPRKLGFPHQDPLQGLSHDDALSLFAQHAFQVP 
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URI±YEEMQGKDHFELKAVWCVSDEFDimiSKIILQSIGGGNQEFTDmLLQVAU<EKlSKKRRXVm 

DVWSESYTDWHI^PRAGAPGSKIII77RKl^lJJ^GYNQPYNL5\rt^HENA^ 

SHFRKPHGEGIVEKCD 
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LARLWDEMQEKDHFELKAWVCVSDEFDlFWISKIIFQSlGQGNQEFKDLNLlQVAVKEKilJ<KRR±^ 

DVWSESYADVVB?ERPFUGMGSKIIfc/nTRKQSllTKLGYKQPYNI^VLSHDSALSl^ 

DSHPTLKPHGEGIVEKCAGLPLALST 
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EFGVGHTnJNFUlYEEMQGKDHFELKAWVCVSDEFDIFNISKllLQSIGGGNQEF^ 

KRFLLVLDD\WSESYTDWB?ERPRJ\GAPGSKIirrrRia^LIJ^KLGYNQPYNL5VL^^ 

ALQEDNFNSHPTLKPHG7GIVEKCDGLPLALS 

5^(5 SO ^O'ZO 
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1 TTNACACCAT AAATTCTCNA CCTGNGGGGA CAAAAACCTA AAAATGGTCC ATAATGCNCA AATCA GNAAG 
71 GTTGANAAAG CTCTAAGTTT TINACCTCCA NCTGATGCNC NNTCCTCOTA AAGTTCANAT CCAAGCTTGC 
141 CCTCCAACTC TAIiCNCCTTC AATGGCACCT CCTTCTCTTC AAAAGCACAC AAGAACACIT TCJAGCT^ 
211 CCACACICAC ACAAGCTCTA GAACNAGGGT TAGGGCACAT TTAGGGTnT (^TCTCUGGh AATGGTGTCT 
281 AAAAGTGAGG CCATAATGTT CCTTATATAA GGCTCACTCC CACAATTAGG CnTCAATCT GAACGTAITrA 
351 CGCCCAGTGT ACACTATGGT ACGCCCAACG TACTCGGTAG TCTCCGCGTC AANAA TACAC TC ATSAGTA C 
421 GCOCAACGTA CTITCCCTTA CGCCCAGCGT ACICAAAAGC CAAACATTCT TITCAAGGAC TAATTITCAC 
491 AACTTGAGGA AAGAAAAGGA TCAAAGANAT ATACTTGAAT TCCGGGATGrP TACAATGAAG TTGANACCTT 
561 GGCTAAAAAA TTAAATTQGT TGTGGAAGCC GTTGGCTGAG CAAGCAACAA G GGTAAAATT CGTAA TCTAC 
63X AAATOGrrOTT ATmCTATT TOITCITA TT ATmACITG ATTTROSGGT AGT mTl' lT TCTEACAAAA 
701 AATATTAAAG TTGATAAAGT ATAGCCACIA AAATTGACIT TTTCCAAAAC ATAATGTCAA ATGCTTGCGTA 
771 TATGTATCAT GTIGTATTAN ATAATGAATA TGATGATNCT GTTCTATrTA ANC CGAAAAA ATTATCTAAT 
B41 GA TITIA TAT TGGAAAACAA AGTTGTGATT TTINGCATAA TATAATCAAA TCCNCTmC TNTGGGAGGT 
911 GGATAAATGT GGTAAATTTA NAACAAGTGT TTTNACNTTG AAGGGTITTrGG AAAGG TTSAA AAAAGT TAAA 
981 ATGATAAAAT GTTTACACAA ATGTTGTATC CGACTGAATA IWATGrTTTAA GGA1NATTGT ATTAAATTGT 
1051 TGATATATAG TAAGCATAAA TATTTAGAAT TGTGACTTAA ATTTATAAGT TATNCNAACT GGATTGAAAC 
1121 ATTTITCATA TAIOATTAGGA ATGAAAATGA GCAACCCTAA CATACTTATC nTGG TAGTT TGGTTATTAT 
1191 ATTTrrATTA NAATATAGAA NCATC CLTIT ATTTTAAACC CATATTGTGG ACGGACITGA ATAAATGGGA 
1261 AAAATGTACC TTGCTATTTA GCACAAAAAA A1TATAAAAA TGTACATIGC TATITAGCAC AAACAAAAAA 
1331 AAAAAACTTA ' lU;mTlUC ATTAGGTCAC AAAGAAATAT AAAATGGGAA ATGTGTTGCT ATITAATCCA 
1401 CTAAAAGAAA CTATTITGCC TTIATTAAAC CGGGTAAACC AATAGAAAAA TGGAAGTACA ri CriCATT TA 
147X GCATGAAAAA AAATAACTIT CCATTTTITG CATCCGGTCA CAATAATAGA AAAATGAAAG TAOOTTOCTA 
1541 TTTAGCGAAA CTRACtTCCT ' ITmcmT TGQCATCGTA TCATAAAATA TAGACTAAAA TACGTTAGTr 
1611 TTACATTTTr AATACATTGA AATGTCTAAT CCACATGTTA TICTATAAAA AGGGAAATGT AATrTACITA 
16B1 TTCnTSATT CTl ' lUJCi ' lV TmTAGTAC CCAAAACATC CCTCTATCCA TCTATTCCAA CTA AAATAAT 
1751 GAAAACTATA TTCCTTCCAT TGTAGGGAIG TTATAAATTT TGTAATIGTT TTTA TGCAAA AAAGTGTriT 
1821 TrGTTAACTA GATTAACGAG ATTCATnTT CAGCATTTTA GGAGAAGTTC ATCCATCTTT TGGATATGAA 
1891 GTGCAAGCCA A GTIUI ' IT AA CATGGAATAT GAGGTCCCTA T ATGCT CAAA AAATAGCAAA TGAGAAATTT 
1961 TTTAAArrGG ATCCCCATAA AAGAAAATTT GTTAATGGTT CmTAATAT TGG TCAATG T G TCCACCG GA 
2031 TGAGCA"rAAT ACTAGTTTAT AAGGGGXAAA GGTOGGTTTG GTGGGCCCAT TTA'i Um'AT T^TTCTAAA 
2101 AGTCAGAATT AAGTAAAAAA AATTATAAGA TAAATACCAT AAGGATAAAA AATC ATTTTA TTTG GACCAA 
2171 AGACCAAAGT TGTTAAGGGG CiU'ITm ' nT TmTGTCAA G AGCIG TGCA ACCACTTTTG TCTGCGCCGC 
2241 ACAGACAACG TGCAGACATA TGCCCTCGCA GAGTGTITGT TnTTGAAAG TGCGCAGACC AAAA AAACGT 
2311 CTGCGCGAGG TCATCCTGGC GCATATATGT GTCACTGTCT TCAAAGGTCT TCAGACCTCA TTTTAACCAA 
2381 AAAAAAAAAA GACCACCGGT rXTi'mTrr TnTl t Tnvr TTCTCTTGTA GCTGAAAATG CATTnTAAT 
2451 CTTTATGACA TGAAATTAAG TTTGAAAAAT TAATTTATTT CAACAGCTGT AGACGTTAAA AACAAACAGT 

2521 cnmvrm cagactgtgg acatttggtc cacctcttct accgcagaga cttgcagatg tggtccgcag 

2591 ACTGCAGACA TTTTOGCTrC AAATAAACAA ACATCACCTA ATTTGACTAC ACCACACGGA CCTCCAATGT 
2661 AACAAAAAAA AGGITGAAAC AAAGTTGCCrr ATTTCTCCAT ATCCAGGGGC CA TITATG TA AGAGTTATCT 
2731 AAATmAGT TCGGTAGATC AGITCTCACA TTTTAACCGG GTAAAGTGTA TC'IGTOTACG CGCGC A CCTG 
2801 AAAGCmGA ANGTAACTTC CAAACIGAAN CAANAATCGA TATG&AGTAT CAAGTTAGAG GTTCAATOQG 
2871 TGAAGGAATC AGCTGGAGGT TGGQGRATCG AGCTTCCACT ATTRAGGTAA AATCCATAAC C CTAAA TOTT 
2941 GGTACCCICA TATATCAAAT ' iUK i UVm ' GTTGAATGAA AAAAG CATGC TCAAAAAACC AGTGTAAGGC 
3011 ACGGXATATG ACATATTTAT AGTTACTGAT AACAAATTAT GATAATTTTG GGTITACXSTA AGTTAG GATT 
3081 CGTACTTCAA CCAAATGTAA TAGTnTrGT GAGTC TATC T ATGTAriTGG GGAATCACAT TAGCAA CGQG 
3151 ATTGTACTAG TAATTCGAAA AAGTCTTTTA AATAATnTT CTGTTTATAA TTTATGAATA GOTTAGCGA 
3221 CATCTAATAT TAAATAGAAT GTATCTGATA TTGAATIAAT UnTXT rAATG TGAAC ATAGA CCTTTTCCAT 
3291 TTACTAATGC CTAATTATTA GTITCTAATC AATAAATTTT AATITCTGTT TTATGCTTCT AAGACAATAA 
3361 AAATCCATGA TTTACCTTTA AATATTAACA AAAATGACCA T AAATAA ATA AAAAATTAGG ATA CCAAA OC 
3431 CCCCCCXXAT GCCCAATGTC TAAATATTCT TGATGCTTIT GCTOTICCCT CmTOTTG TTA GTCIATT 
3501 ATTCrGSAGA GTITGAGAGA GTITCATACA AGAAAATTTC AAGAAGAAAG CAAAGGTCC A GGTAriCT CT 
3571 TTTCTXAATT ATSTATTAAC TTACAAGCAT TTTTTACACG ATCCATGGTT TTlTCrGTAT UTl'miA AA 
3641 TTGAAACTAG ATIGGGACPr TTGCCCTTGA TGATICATAA GATATTGCAT GGAC3TTSAGA TTGTGTAAGA 
3711 AAAGTGGTGA ATAGAAA6AG CAAGIGAATC CAGATATAGT ATTGGTAATA TATGAT GATG AGATAGAGAT 
3781 ATGTIAAAAC TGGCTAGAAA ATTGTrTTAA TTTGAAATTT AGGTTGTTGA ATTTGAAAG A TAC CAAGCT A 
3851 ATAACEAATT AGTTATGCTA AATAGTTATA AAGAACAACA AACTCGTAGT TlTl TlTiCA TGAITITCAA 
3921 CCICTTCGTA CCAAACTAAA TTATAACAAA ATTGAATATC ATTCTCTGCA ATCAATTTTA ACTmCTTA 
3991 TTATCATCAT GTCTAAAATT GCCACAAGTT TATTTTCATA GTCATATTGG ATTATGAAAG GACTATTTTT 
4061 ACCAATTACA TCTTTACnT ATGGCCAAAG CTAATACAAT CCGACTAAAC TAAAGGATTC TAGGATGCAT 
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4131 ATAGTrrGCr CCCCGATTAT AGATTTCTAT CTAATITCTC TATTGTACTA ATTTAGGTGC CACCACAAGT 
4201 AAATTCCTGA AATCGATGTC GTTAATCCCA TTCTTAAACC AGTTCTCGAG ACTCTCATGG TACCCGTTAA 
4271 GAAACACATA GGCTACXrrCA TITCCTGCAG GCAATATATG AGGGAAAIGG GTATCAAAAT GAGGGGATO3 
4341 AATGCTACAA GACTTGGTGT CGAAGAGCAC GTGAACCGGA ACATAAGCAA CCAGCITGAG GTICCAGCCC 
4411 AAGTCAGGGG mSTZTGAA GAAGTTAGGAA AGATCAATGC AAAAGTQGAA AATITCCCTA GCGATCTTQS 
4481 CAGTIGTTrC AATCTTAAGG ITAGACAOSG GGTOGGAAAG AGAGCCTCCA AGATAATTGA GGACATOGAC 
4551 AGIGZCAOGA GAGftACACTC TATCATCATr TGGAATQATC ATICCATICC TTEAGGAAGA ATTGATXCCA 
4621 CGAAAGCATC CACCtCAATA CCATCAACC6 ATCATCAIGA TQAGTTCCAG TCAAGAGAGC AAACTTIGftC 
4691 AGAAGCACTA AACGCACTCG ATCCTAACCA CAAATCCCAC ATGATAGCCT TATOGGGAAT GGGCC3GAGTG 
4761 GGGAAGACGA CAATGATGCA TCGGCTCAAA AAGGTTQTCSA AAGAAAAGAA AAXtSTTTAAT TTTATAAtTCS 
4831 AGGCGGTTCTr AGGGGAAAAA ACAGACCCCA TTGCrATTCA ATCAC3CTGTA GCAGATTftCC TAOCTTATAGA 
4901 GCTCAATGAA AAAACTAAAC CAGCAAGAAC TCAGAAGCTT CGGAAATGGT TTGTCGACAA TTCTGGTQGT 
4971 AAGAAGATCC TAGTCATACT CGACGATGTA TGGCAGTTTG TGGATCTGAA TGATATTGGT TTAAGTCCTT 
5041 TACCAAATCA AGGTGTCGAC TICAAGCTGT TGTTGACATC ACGAGACAAA GATGTTTGCA CTGAGATOGG 
Sill AGCTGAAGTT AATTCAACTT TTAATCTGAA AATGTTAATA GAAACAGAAG CACAAAGTTT ATTCCACCAA 
5181 TTTATAGAAA TITOSGATGA TGTTGATCCT GAGCTCCATA ATATAGGAGT GAATATTGTA AGGAAGTGTC 
5251 GGGGTCTACC CATTGCCATA AAAACCATGG CGTGTACTCT TAGAGGAAAA AGCAAGGATG CATGGAAGAA 
5321 TGCACTICTr CGTITAGAGC ACTATGACAT TGAAAATATT GXTAATGGAG TTTITAAAAT GAGTTACGAC 
5391 AATCTCCAAG ATGAGGAGAC TAAATCCACC TTnTGCTTT GTGGAATGTA TCCCGAARAC TTTGATATTC 
5461 TXACCGAGGA GTIGGTGAGG TATGGATGGG GGITGAAATT ATTXAAAAAA mGTATACTA TAGGAGAAGC 
5531 AAGAACCAGG CTCAACACAT GCATIGAGCG GCTCATTCAT ACAAA XVitaT TGATGGAAGT TGATSATGTT 
5601 AGGTGCATCA AGATGCAT6A TCTrGTTCGT GCTTTTGnT TGGATATGTA TICTAAAGIC GAGCATGCIT 
5671 CCATTCTCAA CCATAGTAAT ACACTAGAGT GGCATGCAGA TAATATGCAC GACTCITGTA AAAGACTITC 
5741 ATTAACATGC AAOGGTATGT CTAAGTTTCC TACAGACCTG AAGTITCCAA ACCTCTCCAT TITGAAACTr 
5811 ATGCATGAAG ATATATCATT GAGGTTTCCC AAAAACITIT ATGAAGAAAT GGAGAAGCTT GAGGTTATAT 
5881 CCTATGATAA AATGAAATAT CCATTGCTTC CCTCATCACC TCAATGTTCC GTCAACCTTC GCGTGTnCA 
5951 TCTACATAAA TGCTCGTTAG TGATGTITGA Cr OCTC riVr ATTQGAAATC TCTCGAATCT AGAAGTGCIT 
6021 AGCTTTGCTG ATTCTGCCAT TGACCGGTTG CCITCCACAA TCGGAAAGTT GAAGAAGCTA AGGCTACTG5 
6091 ATTTGACGAA 'ITCITATGGT GTTCGIATAG ATAATOGTGT CTTAAAAAAA TTQGTCAAAC TGGAGGAGCT 
6161 CTATATGACA GTGGTTGATC GAGGTCGAAA GGCGATTAGC CTCACAGATG ATAACTGCAA GGAGATGGCA 
6231 GAGCGTTCAA AAGATATTTA TGCATTAGAA CTTGAGTTCT TTGAAAACGA TGCTCAACCA AAGAATATGT 
6301 CATTTGAGAA GC7ACAACGA TTCCAGATCT CAGTGGGGCG CTATTTATAT GGAGATTCCA TAAAGAOTAG 
6371 GCACTCGTAT GAAAACACAT TGAAGTTGGT TCTTGAAAAA GGTGAATTAT TGGAAGCTCG AATGAAOGAG 
6441 TTGTTTAAGA AAACAGAGGT GTrATGTTTA AGTGTGGGAG ATATGAATGA TCTTGAAGAT ATTGAGGTTA 
6511 AGTCA-TCCTC ACAACTTCTIT CAATCTTCTT CGTTCAACAA TmAGAGTC LTlinwriT CAAAGTOTQC 
6581 AGAGTTGAAA CACITCTrCA CACCTGGTGT TGCAAACACT TTAAAAAAGC TTGAGCATCT TGAAGnTAC 
6651 AAATGTGATA ATATGGAAGA ACTCATACGT AGCAOGGGTA GTGAAGAAGA GACGATTACA TTCCCCAAGC 
6721 TGAAGTTTTT ATCTITGTGT GGGCTACCAA AGCTATCGGG TTTGIGCGAT AATGTCAAAA TAATTGAGCT 
6791 ACCACAACTC ATOGAGTIGG AACTTGACGA CATTCCAGGT TTCACAAGCA TATATCCCAT GAAAAAGTIT 
6861 GAAACATTTA GTTTGTTGAA GGAAGAGGTA AATATAAATT TTTAATGCTA ATACATTACA AAGGATCTTT 
6931 TCAGTTAAAT CTITCAAAAT ATATTGTAAT TTGATrGTAT GGGGTATTAT TGTTGGATGG GACTATTAAT 
7001 AAATGATTAT CTTGCAGGTT CTGATTCCTA AGOTAGAGAA ACTGCATGTT AGTAGTATGT GGAATCTGAA 
7071 GGAGAT ATGG CLT mCGAAT TTAATATGAG TGAGGAAOTT AAGTTCAGAG AGATTAAAGT GAGTAACTGT 
7141 GATAAISCTTG TGAATITGTT TCXXjCACAAG CCCATATCTC TQCTGCATCA TCTTGAAGAG CTTAAAGTCA 
7211 AGAATTGTGG TXCCATTQAA TCGTTATTCA ACATOCATZT GGATTGTCTT GGTGCAACTG GAGATGAATA 
72B1 CAACAACAGT GGTCTAAGAA TTATTAAAGT GATCAGTIGT GATAAGCTTG TGAATCICIT TCCACACAAT 
7351 CCCATGTCTA TA CTGCAT CA TCTTGAAGAG CTTGAAGICG AGAATTGTGG TTCCATTGAA TOGTIATTCA 
7421 ACATTGACTT GGATTGTGCT GGTGCAA1TG GGCAAGAAGA CAACAGCATC AGCTTAAGAA ACATCAAAGT 
7491 GGAGAATTTA GGGAAGCTAA GANAGGTGTG GAGGATAAAA GGTGGAGATA ACTCTCGTCC CCTTGTTCAT 
7561 GGCTTTCAAT CI CTTGA AAG CATAAGGGTT ACNAAATGTO AGAAGTITAG AAATGTATTC ACACCTACCA 
7631 CCACAAATTT TAATCTGGGG GCACnTTGG AGATTICAAT AGATGACTGC GGAGAAAACA GGGGAAATGA 
7701 CGAATCGGAA GAGAGTAGCC ATGAGCAAGA GCAGGTAAGG ATnCAATIT CACTGTCTTA ATTAATGATT 
7771 AAGCTCCTGC TTnTGAATA AAAAAGGGAC AAACCATTTC ATGACTTAAT GTAGCAATAC AAGTCATGTA 
7841 TAAGAGTGAC CAACTCTTTT TTATTTATAA AATGACTACA AAATATITTT TTTCATTAGA GATCATGTAT 
7911 AAATGriGACT AATITITCAT CACCTAACTT TAGTTGATAA A ICTITA TAA ATGTXyVCTAG TTACTTTICA 
7981 GTAAAATAAC AAATTTAATA AATTATCAAC AAAAAGCATC AACTAAAAAA ATCCCACAAC CCGTAATAAT 
8051 TTAAAATAAA AGGATTTAAC ATCXAATACXS AACAATmT TTTCTAAACA TGATTIGGAC CAAATATCAC 
8121 CAGCAACTCA AGnTGGAAT C GATTC AGCT TAAAACTTG A CTAGCATAAT TAGATAGATG AGAGTTGAAG 
8191 CTAAAGfTGCC TATATAAGTT CXjTTTCATCT TTTTTCTIGA TCITGATAGC AAGTTGAATG ATITTCTTCT 
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«2«1 TCWUUaTOA TAAAAATCTA CATTATAAAG AGACTAGCTT GAAAAAAAAT GgTCTAGGTG GCTOTGGCT 
Ji« tS^T^St GA^TGCSAA GGGGAGAGTA TCATTICRAA GACACAACAC ATCCnCATT T TATrrATIT 

IS^^I SStTO ATATCTIGCr CATATTIGTr ACAGATATGT CSAGGTCTATT AATCIOTV 
M?i ^A?S IaSSS ACATAAATCA GAAAATTAAA TAAAGAATAA AITAMAAGG GCfiCAMOCT 
B5U CTTT^CGT AAGACAAGGA CCAAACACGC AACRAAAATA AACAGTAGGG ACCATOCGAT ™«*»*«* 
BSll TAATTMSGA CCAAAAACAT AAATICCCCC AAACCATAGG GACCATICAT GT AATmCT CmCTXTIC 
868^ CTTT^^ OTGGSa ACrArmTT TICTACACAT CXAGGTRACG AACTIGTIGA AUlUTiUCA 

SSlCT GACCTACIAC AACCGATCAT AATW3TCA1A TGIGRACACT tCCAAajACT T^™CTm 
rail S^CA AAAAAACAAT AGITACCATG ATOTGAACAT ACTGAAAAAT TAATTAOOT AGCAACTECT 
M91 TTOCCATTT AGCTIGTATO OAAACAGTIC CGTCABACCXS TCACPIGGAT GGTAGATAAA TTTAGTOAAC 
9«l ^A^rc AMTAACCTA CCmTICTr ATTAACICAA TlTCAACCrA AATTCTGATr CTronTGAA 

I^^Sc SotIT TOTATTATC •ITOITCCATA GGATCCTTAG CATCTTITAA TAATTTATrr 

Moi ^^m^ gmSsa •mrrAATCT crrroGCATrr tccatcattt GCAACiGirr cttgaaaaaa 

9m SS^S^ AATC^A ACCATmCA AATCCAAAAT TATAAGAGAG AATTGTAAAT GGACATCGAA 
l»l ^^^^ tScACT TCAOIAAACA AOrrGCTAAT TACATTTCTr GCTOIGCAGA TIGAAATTCT 

SaS^ aagaagccac tcacagisvtt tctaatgttg tat tcccatc ctgtctowg 
«ai cAcic-rrrc ataacctcca gaaacttata ttgaacagag ttaaaggagt ggaggtgoio TroaGms 

«S1 AGAGIGAGAG TCCAACAAOT AGAGAATTGG TAACAACrCA CCATAACCAA CAACAACCIA •mTRCTIOC 
9521 S^CTCCAG GAATTCATIC TATGGAATAT GGftCAACATO AGlCATGTGr GGAAGTGCAG CAACTCa«T 
«M AAA T '^- "iU V CICTTCCAAA ACRACAATCA GAATCCSCAT TCCACAACCT CACftACCATA AAAATTATGT 

9 si ^^^S^ cmiSrac ' nvn - nvu. ctctcatcgc AGAAcncrr tccaacctaa agcatatcaa 

9731 GATAAGAGAG TCIGATCGTA TTCGAGAAGT TBTTICAAAC AGAGATGATS AGGKIGAAGA AATCACTACA 
9M1 -rTACSOTA CXXaCaCAAC CACCACTPTG TTCCCTAOTC TTGATrCTCT CACT CTAAGT TTCCTGGAGA 
987X ATCTGAAGTO TATTOGflGGA GGTGGTCCC3V AGGATGAAGG GAGCAATGAA ATATCTTICA MAATACOC 
9941 TOCAACSACT UCIX.T IVLI' U ATCAATTTGA GGTATGCrTT GTACATATTC AATIATTTAT TT^OTCCT 
lOOUTTmiTITC CAATATTCTA TAAATAATAC ATTITATACC CACTATACTA AGAT^^AAT ^AGAGG 
lOOfll GATCGATCCT ATCACACAGC TGCTACACTT CAGAAACTCT AGTAAGGGCA GTTATGGAAG TTCAATAAAA 
lOlSl TCATAATCGC ATCTTITCAT GGGTAATATA G GCAATTTAA GnTrATTTC TOmA AGCA GXATTOra. 
10221 AGTACTTXJCC ACTAGGAGAG GAGAATATCA CCTmCTGA AAATCTGGTC ATIOTAOCCA GJATmBTO 
10291 AAATtriAACA TmAGATAT CAGGGGICAT CAGGTCACAG AT ATTCTAGA ATAGAACAAT ATATAATATC 
10361 ACCCAAAACT ATnTTICTA AGGTTO.TTCT GTTAAATATG TGCTrTCTTG TITTCATNGA ATItroCAlTC 
10«Jl GTATATTITA GGTGTTAAAG TGATTmiTC TTCAATAAAT CCCGAAATTA ATTA AAAAAA AAAAAACARA 
lOSOl AGTACATTTT TGATGTGGAG AGCACTOGTA TCACTTAGTA TATAAAAAGC TTS^mTGA ATTOACmC 
10571 TTATACAAAA GTTGTGTATA TAGTTTAATT AGTnTACAT CATTTTrCCA TGTGGTGTTG CMTTCTCTG 
10S41 AAGCASGTCG TCTTICTIGG AGCTTATGCC AATACGCTAG AGAGATGAGA ATAGAATTCT GCAATCCATT 
10711 GTCAASrt?rA ATTCCATGTT ATGC AGCAGG ACAAATGCAA AAGCTGAAGG AGAGGACAGC GATTCTCGTA 
10781 CGAACGSTTA CXSATTCGACT GGCCGTOGTr TXACA 
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MDWNAILKPWETLMVPVKKHIGYUSCRQYMREMQIKMRGIJ^JATRLGVEEHVNRNISNQL^ 

RGWFEEVGKINAKVENFPSDVGSCFNIJ<VRHQVGKRASKIIEDIDSVMREHSIIIWNDHSIPLGRIDSTC 

ASTSIPSTDHHDB=^SREQTFTmiJAmPNHKSHMIALWGMGGVGKTTMMHRlJ<K^ 

EAWGEKroPIAIQSAVADYLGIEIJJEKn<PARTEKlBKWFVDNSGGKKILVimDVWQF>^ 

PLPNQGVDFKVLLTSRDKDVCTCMGAEVNSrnWKI^ETEAQSLFHQRBSDDV^^ 

CGGLPIAIKTMACTlBGKSKDAWKNALmLEHYDIENIVNGVFKMSYDNL^^ 

ILTEELVRYGWGLKLFKK?YTlGEARTRijNn'CIERUHTNLlWEVDDVRCIKMHDLVR^ 

ASIVNHShrn.EWHADNMHDSCKRI^LTCKQMSKFPTDLXFPNLSiLJ<^ 

VISYDKMKYPLlJ=SSPQCS\mLRVFHmKCSLVMFDCSCIGNLSNl£VLSFADSAIDRLPST!GKLK^ 

IJ^LTNGYGVRIDNGVUO<LVKI.EELYMTWDRGRKAISLTDDNCKEMAERSKDIYA^ 

NMSFEKLQRFQISVQRYLYGDSIKSRHSYEmLKL\n^GELL£ARMNELF^^ 

WSSSQlXQSSSFNNLRVLWSKCAEIJ<HFFrPGVANTlJ<l<LEHL£VYKCDNMEEURS^^ 

KLKFLSLCGIJ'KLSGLCDNVKIlELPQLMELELIDDlPGFreiYPMKKF^SIJJ<EE\^ 

WNIJ<EIVVPCERVIMSEEVKFREIKVSNGDKL\WIJTHKPISLLHHI^ELK^ 

GDEYNNSGVRIlKVlSGDKLVNLFPHNPMSimHLEELEVENGGSIESLPNIDLDCAGAIGQEDNSlSLRNI 

KVENLGKLRTWRIKGGDNSRPLVHGFQSVESIRmCTKFRNVFTPTTTNFNLGALLEISIDDCGENR 

GNDESEESSHEQEQIBLSEKE7LQEATDSISN\A«TSCLJyiHSFHNLQKULNRVKGVEWFEIESESPTS 

RELVTTHHNQQQPIILPNLQELlLWNMDNMSHVmCSIWNKFFTlPKQQSESPm 

U=SPLJVIAELl^NlJ<HIKIRECDGIGEWSNRDDEDEEMTTFTSnrHTTT^^ 

GAKDEGSNEISFNNTTATTAVLDQFEVCFVHIQLPI. 
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1 ACTrrmrr titocaata tccatttata tgccsatttat ttctgaaata atittatcaa aacgcaggaa 

71 AcSroiiGA ATAATACTCG TATAATTAAT TATATAAAGT TATTAGGCTG AAATCTTGAG GCTACTATAA 

lil iS^J^TC ATAAm^ 

ni aSSS ACATCATCAG ACA^ 

291 TOCGaStC TOCCAACCC CACCGACX3AC TTCGGCSTCTC CTTAATATAT CAATTATITr CTGTAAGTAT 

42X ^t^i^^r^. ggmCATTC TTGCATIAAA AAAAAAGACT gTACICTIGT CAAP^TTI TA CTTATAACCT 

m GATTAATTAA TTAAACSCCyrA ATOCATAAT TTGCATn^ 

SGI GGGTOCCGG C3AA1CAAAGC ACTTATGTAA AAGCAGGGC3A AATACAAAAA ATTTACTCGA AACAAATTIT 

631 ATICAATTTA ACaGAGATAA TAATOTrCIG ATTRCSATTAT GAGAACTACSG AGATTTAAGT GATATftTCOC 

TA^M^ AAAATOAGTG GCTATCCm CGCTITCCAA AAAGCAATTA TACTTTATACr ACACCTACAA 
m ATTITAAAAG GGGITAAACA TATCAAAATA 
??1 AGAAAAIGCT ACTAAGCTTC GAC^ 

«i ATMCATCTC ATITOSCATr CATTlCTnT TCACAATTCA TAGTrCTATT CTCAAAAAAT TCGAGTTCTC 
1051 GTATTIOTAA GGAAGATCAG AAGAGACTGT TCACACAGGT ACiViUiTiT ATTnCOTTGAT TCACATOJ^ 
1121 ATATCT-ATT GTrnCTIGC TTAAIGGTrr CGTCAGTCTA ACTGCGCITG CIGATTIAAA TnCTICACT 
1191 TTCTTCCACG GATnTTTAA ATATTAGTIT TGTGAATGAA CAATTGGTGA AGGAAAGAAA CATCGGMTC 
1261 TTTICTAAAG TAAACCTAGA TACTTAGGrT ATAAGGGTAT ATGCTAAAAT GftACTATGCC CATOACOT 
1331 T^ L - I — iUT TTTACrmT AGnTTTAGA ATCCAAGriTP TCATATOTAT CTCGAT GTGT GAGAA GAATA 
i^ni G^TTAGAA AGCTAAAQGA CGTACATAAA ATIGATEAAT TAGTCAATGT TCTITGATAT CATrATTTrr 
1I71 ACTCTCATAA AAAGCATATA GATCAAACAC AAATTGCTAC IW I TAU ' I GT AACAACTTCG ACTTAATAAT 
15U A^ScT TCATTICAAC TATITICTAA CCGAACAAGC TCACTAAAAA CICATATTGC 

S?rmT ATrroGGGrr ttacatitaa TrmrciGc atcaatgtga aaai^ctg 

I6BI CTTATTOATT cmUiWlTr CATTCAGTTG ATrTTCATTA TTACrACCTr ACAAATTGCT CAGTGATAGA 

ttocStaa TrrocTAATr cGGrrccnc taaatatgta ggagctacta aaagcaaaaa tatcgagcaa 

1B21 -TCTCGGACCC AACGGGGATT GCIGGTGCCA TTATTAACCC AATTGCTCAG ACGGCCTTGG TTCCCGTOAC 
1B91 GGACCAIXTrA GGCTACATCA TTTCCTGCAG AAAATATGTG AGGGTCATGC AGATGAAAAT GACAGACTTO 
1961 AATACCrCAA GAATCAGTGT AGAGGAACAC ATTAGCCGGA ACACAAGAAA TCATCmCAG TOCMCTCA 
2031 AACTAAGGAA TCGTIGGACC AAGTAGAAGG GATCAGAGCA AATGTGGAAA ACTTICCGAT TGATGTCATC 
2101 ACTOTTICTA GtCICAGGAT CAGGCACAAG CTIGGACAGA AAGCOTTCAA GATAACTCAG CAGATT^ 
2171 GTCTAACGAG ACAACTCTCC CTGATCAGTr GGACIGATGA TCCAGTTCYT CTA GGAAGA G TTGGTTCCAT 
2241 GAATCCATCC ACCTCTGCAT GATTAAGTGA TGATTTCCCA TCAAGAGAGA AAACmTAC ACAAGCACTA 
2311 ATAGCATICG AACCCAACCA AAAATTCCAC ATGGTAGCCT TGTGTGGGAT GGGT GGAGTG GGGAAGACTA 
23B1 GAATCA-TOCA AAGGCIGAAG AAGGCTaTTG AAGAAAAGAA ATrGTrTAAT TATATItnTG GGGCAGTTAT 
2451 AKGGGAAAAG ACGGACCCTT TTGCCATTCA AGAAGCTATA GCAGATTACC TCGGTATACA ACTCAATGAA 
2521 AAAACTAAGC CAGCAAGAGC TGATAAGCTT CGTGAATGGT TCAAAAAGAA TTCAGATGGA GGTAAGACTA 
2591 AGT-rcCTCAT AGTACITGAC GATGTTTGGC AAITAGTIGA TCTTGAAGAT ATTGGGTrAA U-j Wi-iriiA: 
2661 AAATCAAGGT GTCGACTICA AGGTCTTGTr GACATCACGA GACTCACAAG T TTGCACTAT GATOGGGfiTT 
2731 GAAGCTAATT CAATTATTAA CGTGGGCCTT CTAACTGAAG CAGAAGCICA AAGTCTCTTC CAACAACTG 
2801 TAGAAACTTC 1GAGCCXX3AG CICCAGAAGA TAGGAGAGGA TATCGTAAGG AAGTGTTGCG GTCTACCTAT 
2B71 TCCCATAAAA ACCATOGCAT ffH^TCTTAG AAATAAAAGA AAGGATGCAT GGAAGGATGC ACTTTCGCGC 
2941 ATAGAGCACT ATCACATICA CAATGTTOCG CCCAAAGTCT TTGAAACGAG CTACXaiCAAT CTCCAAGAAG 
3011 AGGAGACIAA ATCCACTTIT TrAATGTGTG G ' mU ' mLX: CGAAGACTTC GATATTCCTA CTGAGGAGTT 
3081 GATGAKTTAT GGATCGGGCT TGAAGCTATT TGATAGAGTT TATACGATTA GAGAAGCAAG AACCAGG CTC 
3151 AACACCroCA TTCAGCGACT GGnGCAGACA AATTTGrTAA TTGAAAGTGA TGATGITGGG TGTCTCAAGA 
3221 TCCATCATCT GGTCCGTCCT Tl ' I V r iTi UG GTATGnTTC TGAAGTCGAG CATGCnCTA TTGTCAACCA 
3291 TCCTAATA1G CCTCGGTGGC CTGATGAAAA TGATATGATC GTGCAOTCTr GCAAAAGAAT TTCATOAACA 
3361 iraVAGGGTA TCATTGAGAT TCCAGTAGAC CrCAAGTTTC CTAAACTAAC GATriTGAAA CITATQOOT 
3431 GAGATAAGTC GCTAAGGTIT CCTCAAGACT TTTATGAAGG AATGGAAAAG CTCCATGTTA TATCATACGA 
3501 TAAAATOAAG TACCCATIGC TTCCTnGGC ACCTCGATGC TCOtfXMCA TKXXSGTGCT TCATCTajCT 
3571 GAATCnCAT TAAAGATGTT TGATTGCTCT TCTAfTCQGAA ATCTATCGAA TCTG GAAGT G CTGAGCTITG 
3641 CAAArmCA CATTCAATGG TTACCTICCA CAGTCAGAAA TTTAAAGAAG CI AAGGT TAC T TGATC TGAG 
3711 ATTITCTCAT QCTICTCCGTA TAGAACAGGG TGTCTTGAAA AGTITTGrrCA AACTTGAAGA ATnTATA TT 
3781 GGAGAraCAT CTCGGTTTAT AGATGATAAC TGCAATGAGA TGGCAGAGCG TTCTTACAAC CTTTCTGCAT 
3B51 TAGAA'TTCC GnCTITAAT AACAAGGCTG AAGTGAAAAA TATGTCATTT GAGAATCTTG AA CGATTC AA 
3921 GATCTCAGTG GGATCCICTT TTGATGAAAA TATCAATATG AGTAGCCACT CATACGAAAA CATGTTGCAA 
3991 riUJiG ACCA ACAAAGGTCA TGTATTAGAC TCTAAACTTA ATGGGTTATT TTTGAAAACA GAGGTGCTIT 

4061 TrrrAAoror gcatcgcatc aatgatcttg aagatgttga ggtgaagtcg acacatccta ctcagtcctc 
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4131 TTCATTCTGC AATTTAAAAG TTCTTATTAT TTCAAAGTGT GTAGAGTTGA GATACCTTTT CAAACTCAAT 
4201 CTTGCAAACA CTTTCTCAAG ACTTGAGCAT CTAGAAGTTT GTGAATGTGA GAATATGGAA GAACTCATAC 
4271 ATACTGGAAT TGGGGGTICT GGAGAAGAGA CAATTACnT CCCTAAGCTG AAGTTnTAT CTTTGAGICA 
4341 ACTACCGAAG TTATCAAGTT TGTGCCATAA TGTCAACATA ATIGGGCTAC CACATCTCGT AGACTTGATA 
44U CTTAAGGGCA TTCCAGCTTT CA CAGTC ATT TATCCGCAGA ACAAGTTGCG AACATCTAGT TltnTGAAQG 
44B1 AAGGGG TAGA TATATGTTCT TTATOITAAT ACAATTTAAA TAATATTTTC AACXAAATTT TCATAATATA 
4551 TCroiAATTT GATT GTATGA TGTGTTATTG TITATATGTG GCTATTAAGG GATGATTATT TTGCAGGTrG 
4621 TG ATTC CTAA GTTGGAGACA CTTCAAATIG ATGACATGGA GAACTTAGAA GAAATATGGC CTICTGAACT 
4691 TAGTGGAQGT GAG AAAGTTA AGTTGAGAGC GATTAAAGTG AGTAGCTGTG ATAAGCTTGT GAATCTATTT 
4761 CCGCGCAATC CCATGTCICT GTTGCATCAT CTTGWUSAGC TTACAGTOGA GA A TT GO G G T TCCATIGAGT 
4831 CXnTATTCAA CATTGACITG GATTGTOTCG GTGCAATTGG AGAAGAASAC AACAAC3AGCC TCTIMGAAG 
4901 CATCAACGTG GAGAA TITAG GGAAGCXAAG AGAGGTOT G G AGGATAAAAG GTGCAGATAA CTCTG A TCIC 
4971 ATCAACGGTT TTCAAGCTGT TGAAAGCATA AAGATTGAAA AATGTAAGAG GTTTAGAAAT ATATTCACAC 
5041 CTATCACCGC CAATTTTTAT CTGGAGGCAC TITrGGAGAT TCAGATAGAA GGTTGCGGAG GAAATCAOGA 
51U ATCAGAAGAG CAGGTAACGC TTTCAATITC ACTTTCTTAA TTAATTAAGG ACTAAGCTCC TGTnTTTOA 
5181 ATAATAAAGA GGTGQ GATGA CT AAAC TTGG GCATCACAAT TGCAACAAAA TCTTACAAAC CATGAAACGT 
5251 TCAAACCATT TCTT GAATTA AGGmCAAT ACAAGTCATT TAAAAATATG GCTTAAATTT TTnTATATT 
5321 TATG TATCA A CATG ATTITT CAOTAGAGAT CATTATTATA ATAGTAAGTT TAAAGCAATT TAAATCAGAA 
5391 CTAA TICTAA CTTTAGCTAA TAAATCGTTA TAAATGTAAA TAATTACTTr TIAGTGAAAT AAGCAACGGA 
5461 TTTAATAAGT TAACAACTTA AATG TCATIT CCTAACAAAA AAAACTTTGG TTCAGAAAAA CCGCAATTCA 
5531 AGATAACIAA AATAAAAATA TTT GACATTC ACTAAGAGCA ' i ' lTnTlTItJ TAAATATGAT TGCAAATGAA 
5601 TAAA ACTTAA ATTTAT ACAG AAAAT TCTTT TATATATGTT ATACAAAATT TACAAATTGA AATTGGATAT 
5671 GTOAATIAAC GU'l 'lTAT AAT TCTQGTA'ltA CAAAGGGAXA TATAATAAAA TATTATTITC TGTAGTCATT 
5741 TSTA ATTOTA CTAGTrTATA ACCCGT0GC3A ACXaTGftGTT CTAAAATTAG TTAAACnTC ATAATAAAAA 
5811 TTTATAATTA TrATTTATIT TAAATAAATT ATTAATTAAB AGATATATCA AAAATITAAA GTTATTATAA 
5081 CTTCAAATTT AAC ATATAAT TAGAAAATAT ATGATCATAA CTTCTGCACT CTCmGIAT AAATGCAG2VG 
5951 AAGCTATTAG TATATTTCTA ATCAAGTCCA AACCTAATGA AGCCTATATA ATnTGTGAA AACTCAATTA 
6021 GCATTAGG TT TTAAGAGTCA CCAAATTCAA AGAATAATCC AAIGCTTTCA TTACCACTAT GGAGAAAATA 
6091 - ri-i'^w -ri-AGT TT AAAT GAAA TGAAAACAAA CATTCAAACT AATItriTGCT TATTAAACCA AAGACCCATT 
6161 ACTTAGCCAA GAGTTT AACA AAAAAAAATT ACATTCATGT ATCATIATTC ATGACTAGAT ATATATGAAC 
6231 ATGAAGGGAG TmTATAGA AAATATAATC ATAGATATTC AACATAACTT CAGGGAATTC CTCAAAATAA 
6301 CCAAGTTATT CAAGAAATTA CATCCAAGTC AACCAAAGAG AAGTTTAGCC TAGCATGGCT AAACTCAAGA 
^^^^ ^^EE^^^^ ^E^EEf^ GTACCAAACA TGTAGTAAGA ATCACAGTAA AAGATGATGT TGTTCTTQAT 
6441 oi-A^ ,-.;^-i-AA GTTCTICAAG TCTCCAGTTG CTCCTAATAA TGCAAAGGAG AGCCATTAAA TTCGTATGTA 
6511 TTGATCCCTT CAAAAGCTGC ACCAACCTCC CTTAAATAAC ACTCAAAGCA AAAATOACAA AATQXCTGA 
6581 AGGACC CTAT GTGG GTGCCT TGCGCQGGTG GAGCIGCATA CGAAAGGTCT ' nUCrXVmXi TGAGQGTGAT 
6651 Ui -iWviCGGG ATAGCTTGTC GCATGCTTCC GCGOGGTICA OGCACATGTG CACAGGTGAT GCATCGIGTS 
6721 TGCGTTCTTG AGTTTTGAGC CTCCGATGCT TAGTCCACTT GGCCCAATTC GACTCCAATC AGCTTATAAC 
6791 CCAi^-i'iiYi' TCAACTTATC TTCAAGTTAA GCCCAATTTG GCTTCTOCAA ATCATCCATA ACTICACAGA 
6861 ATCGCCCGTT CATCTTAATC CCGGATGCAC AATTATICTC CCGTCTTCAT TTTAAGCAAG ATACCACCTT 
6931 C TICATGCT T CATCCATCAA TAGTACACTT CATGTATCAT CTCTACTAGT TAITTAGTCC ACAAATCCTT 
7001 GTTCrrCCTCC AAATITAATT ATCTCATTTA GTTCCCCGTT CCGCTACTTT CCnAAAATT TGGAATTAAG 
7071 CTCAGAGAAA TATTAAGTAC CCGAAATGGT CATAAAATTA ACAAAAAGGA AAATGCATGA AGATTAACTA 
7141 A ATGATGA AC G AAATAT GCT AAAATAGACT ATAAAATGAA GTAAATAAAA TGAAATTATC GCACTCCGAC 
7211 CACCCXTATG GCITOTRGTC CACOCACCCT TCATTCCTTG TACCAATATG GGATGGAAAC ATCATTAATT 
7281 AAGCCAAAAA GCTAACATAT AAGGGTTTAG TGACAAAGGT AAGTACTAAA GATGAAAATA ATCCA TITIT 
7351 CTTGTTTITA CACAA CACAC ACATAGGGGC AGACGTAGGA TTTCAAAGEA CAGATTGTTG GTGGCACATA 
7421 AGTGTOCTG G TBACATriT TrmTCTTT TTACGTGG TC GCACAACAGT AGGAAAAACG AAAAATTOGA 
7491 AATmTTAC AATITOICTT AAAAAAAACA GGGGTTGTIG GTOOCACTAT GGACAACAAA GTTSAACTGC 
7561 CCTACGCGCG CACACA CACA CACACACATA GAGAGAGAGA GAGAGAGAGA GAGAGAGAGA AAGAAAGAAA 
7631 GA GAGAG AGA GTITGGGATG TGA TACTICT TTTAGGAAAA TGGAGTTATA TCTTTGATAT TGTATnnT 
7701 TAAT GTAATT TAT OTAT TTA ATCATTTTAG TTTATAAGTr NTATTTATm GGNTATGAAA AAAAAAGTCT 
7771 TnATACATT GGATTTAACA TAAAAATCCA ACAATATTAA TCAAAAAGAC CAAACATGTG GACAATTATC 
7841 TATATAATTA ATTCACAATA GTCTTTAGGA ATAGTATTAT ATATATAATT AATICTCAAT GGTCTTAGGA 
7911 ATAGTAAGTT CTTATATITC AAACTTITGC CACAATTCTT TGCTTACTTr GACACTTTTC CTTCCT/^ACT 
7981 TTACATATAT ATATAT ATTA AAGC GCAAA G GTCATAGGAA TATAATATTT TCTATTATTC TACGTITTGC 
8051 CACAA AAGTT TGA ACACT TT GCGACTITIT GTCCCTCCTT AACCnTTCA ATGTITIGCG ACAAAAGTTC 
8121 CAAAACTTTG CCACX TilAT CATECCTCAA CTITTCACCG CATTAGnTG TGGAGITQGC AGTriTCCnC 
8191 CCTCTUVCTT CGATATTCTC TACTQCTAGC CAAAAAGGGT TCCAGAGTIT CACACinTG GTCCCTGACA 
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826X GTAACCAAAT GTGAGATCIC AAAOTTTrGC CACATTAGTT TGTGGAGTTG 'IUXTiTiU3 TCCCCCCACA 
833X TTCGATATIC TACTATCAOGA TCTrA TmT CTCAAATAAC AACACGIATA TTTCATC iCT AATTGGAAAA 
8401 AGAGmTAA M:AAATAAC GACTAGG: : : G;GC:GAGTr TTTrrPiACA ACaTXTCTATC AAA TCAT^^T C 
8471 AAAATTTAAG GIGGAACGGT GACCACATTA ACCAGAAATG TAATTTATTC TTTGATnTG ATAATnTTA 
8541 ATATnTGTT GTGATCTATG TATTTAAAAG TAAACAACAA AGAACATAAT CCAAAACCCT AAATTGCAAG 
8611 ICTCGCCCAA TTTCTCTATC ACTAGTCCTC ACTTACGATG GCGTTACXTrC GCTCTCTCAC TGCTIACAAC 
8681 CCnTCTTGC TACTCATTAC AATAACGAAA AGTTGAATAT CCATATATTT ATTTGGATGT GGAA1TGAAC 
8751 GAATCICGTC AAAATTTTGA TrTTGTTGAT GGATTTGAGT AGAAGTTTGG GCAGAACGGG AATGA TGGTC 
8821 TGCAAGTGGT TATAAACTTG ATTCTGAGTT ATT ACTA TAT ATGTAGCCTC TTTAC AACGA CCAAGGTTTC 
8891 TTCCAGGTAC CATTTGATCT TnTAGAACT TACmTCTG AAACACCCTG ATTTCGATCA AATATCA CCA 
8961 ACAACTCTTA AAAACTTGAT TAATCAATTG TTITCTTCAT CTTGATAACA AGTGGAATGA TmCTACTr 
9031 AGATTAACTT GAAAAAAAAG GTCXZATGTGC GTCTO GTOG A TCTGGTAAAT G AAGATG GAA GG GAGAGCTG 
9101 ACnTAAAGA CACAAACACG TCACCATATC TCTIAYm'A TnTAAATIT GCmTQGTG TATTTTCm 
9171 TTTCCTATTT ClTlVmLT TGATCTCCAG ATGGTA TGTG GT GTGG ATAA TITA CACCT A GAGATTOGGA 
9241 ACGATGGGAA GGGGICTGIG ATTTATGGCT GGCCGAGTTT TACTTATTAA CTTCAATTTCA ACCTAAATTC 
9311 TGATTCTTGT TTGAAAATAA GTIGCATCrr TATITITGTA TTATCTTGTT GC ATAGG ATC C TTAGCA TCT 
9381 TTTAATAATT TATTTGAAGG TGAAAGATCC AACTATmT TAGCTGTTGG CATnTCCAT CATTTGCAAC 
9451 TGTrrCrTGA AAAAAAAATA CCTAAAATAA AAATAACCAT TTTCAAATCC AAAATTATAA GAGAGAATTG 
9521 TAAATGGACA TC3GAATCATA AATCATTAAC ACAGTTCAGT AAACAAGTTG CT AATTA CAT TTCTTGCTGT 
9591 GCAGATTGAA ATTCTATCAG AGAAAGAGAC ATTACAAGAA GCCACTGGCA GTATTTCAAA TCTrGTATTC 
9661 CCATCCT3TC TCATGCACTC TTTTCATAAC CTCCGTGTGC TTACATTGGA TAATTATGAA GGAGTGGAQG 
9731 TGGTArrrGA GATAGAGAGT GAGAGTCCAA CATGTAGAGA ATTGGTAACA ACTCGCAATA ACCAACAA CA 
9801 GCCTATTATA CITCCCTACC TCCAGGATTT GTATCTAAQG AATATGGACA ACACGAGTCA TG TGTCG AAG 
9871 TGCAGCAACT GGAATAAATT CTTCACTCTT CCAAAACAAC AATCAGAATC CCCATICCAC AAOCICACA A 
9941 CCATAAATAT TCTTAAATGC AAAAGCATTA AGTACITGXT TTCPCC TCTC ATGGCAGAAC TTCTTTCCAA 
lOOll CCTAAAGGAT ATOCGGATAA GTGAGnGTGA TCGIAITAAA GAAGTTGnT CAAACAGAGA TCATGAG GAT 
10081 GAAGAAATGA CTACATTTAC ATCTACCCAC ACAACCACCA CTTTOTTCCC TAGTCTTGAT TCTCTCACTC 
10151 TAAGTrrCCT GGAGAATCTG AAGTGTATTG GTGGAAGTCG TGCCAAGGAT G AGGGGAG CA ATGAAATATC 
10221 TTTCAATAAT ACCACTGCAA CTACTGCTGT TCTIGATCAA TTTGAAGTAT GCnTGTACA TATTCCATTA 
10291 TTTATTTAAT TTCCTTTTTT ATTTGCAATA TTCTATAAAT AATACATTTT ATACCCACTA TACTAAGATA 
10361 ATAATTACCT AGAGGGATGG ATGCTATGAC ACAGCTGCTA CACTTCAGAA ACTCT ARTAA GGGC AGTTAT 
10431 GGAAGTTCAA TAAAATGATA ATGGCATCTT TTGATGGGTA ATATAGGCAA TTTAAGnTr ATTTCTCTTA 
10501 AAGCAGTATT TAGCAAGTAC TC3GCCAGTAG GAGAGGAGAA TATCACCTTT TGTGAAAATC TGGTCATTGT 
10571 ACCCAGAATT TAGTTAAATG TAACATriTA GATATTAGGG G TIATCAGG T GA CAGATATT GTAGAATAGA 
10641 ACAATATGTA ATATTACCCA AAACTATTTT TTCTAAQGTT GCTCICTTAA ATATGTGCTT TCOTG^mTC 
10711 ATTGAATTTG CATTCCTATA TITTAGGTGG TAAAGTGATT GTCTdTCAA TAAATCCCGA AATTmTAA 
10781 TTAAAAAAAA AAAAAACAAA AGTAAATTTT TGATATOGAG A GCAC IGgTA TCATT TAGTA TATAAAAAA C 
10851 AGATTTTGAA TEAAGTITCT TATRTAAAAG CTGTGTATAT AGTrTAATTA GTriTACATC ATTmCCAT 
10921 GTGGTGTTGC AGTICTCT G A AGCAGGTGCT GTPTCTTGGA GCTTATGCCA ATACXXTTAGA GAGATAAAAA 
10991 TAGGCAACTG CCATGCATTG TCAAGTGTGA TTCCATCTTA TGCAGCAGTA CAAATGCAGA AAQCTT 
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MSDPTGIAGu^llNPIAQTALVPWDHVGYMISCRKWRVMQMKMTELT^rSRISVEEHISRr^ 

SarKEVVUDQVEGIRANVBJFP!DVIT(X:smiRHKLGQKAFKITEQIESLTRQLSUS^^ 

SMNASTSASl^DDFPSREKrFrQALIAL£PNQKFHMVALCGMGGVGKTRMMQRLKKA?EEKK^^ 

GAVI?EICmPFAlQEAIADYLGIQLNEKrKPARADKmEWFXKNSDGGKTKmvm^ 

SPFPNCKaVDFKVLLTSRDSQVCTIVMSVEANSUNVG^ 

CGIJ'IAIKTMAC?lJ^NKRKDAW<DAI^RIEHYDIHNVAPKVFETSYHNLQEEEmSTFI^ 

PTEELMRYGWGU<LPDRVYTIREARTRlJ^CIERLVQTNmESDDVGCVKM 

ASIVNHGNMPGVVPDENDMIVHSCKRISLTCKGM1BPVDU<FPKLTILKLMHQDI«1^^ 

HVISYDKMKYPLiPUPRCSTNlRVLHLTECSLKMFDCSSIGNL5NLE\n-SFANSHIEWlPSWRI^ 

RLiJDlJ=lFCDGLJ=^lECK3VlJ<SR/KLEEFYlGDASGHDDNCNE^MERSY^ 

Nl^FWSVGCSFDENlNMSSHSYBslMLQLVTNKGDVUDSKLMaLJ^^ 

THPTQSSSFCNtJ<\aJISKCVElJlYlJ=KmLArm5Rl^l£VCECENM 

LSLSQlPKLSSLCHNVNIIGIJ»HLVDLllJ<GIPGFrVIYPQNKLRTSSLLKEGWIPKLETLQIDDMENLEE 
IWPCELSGGEKVKIJVilKVSSCDKLVNLFPRNPMSLjmHL£ELT\/ENCGSIES!^IDm 
KSI±RSINVENLGKmEVWRIKGADNSDLINGFQAVESIKIEKCKRFRNIFTPlTANPrL£AlJ£IQIEQ^ 
GNHESEEQVTLSISLS 
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BD33 (r^ RD33) 

1 AATGGCAAAA GAAGTCGGAG CAAGAGCTAA GTTAGAGCAT CTATTTGACG TCATTATCAT GGTAGATGTC 

71 ACTCAAGCAC CCAACAAGAA CACAATTCAA AGTAGTATTT CAGAACAGTT GGGATTAAAA CTGCAAGAAG 

141 AGAGCITGTT GGTAAGAGCA GCTAGGGTAA GTGCGAGGTT AAAAATGCTT ACAAGGGTGC TGGTGATATT 

211 AGACGATATA TC3GTCAAGGC TTGACATGGA GGAACTTGGG ATTCCCTITG GATCAGATAG ACAAC ACCAC 

281 GGCTGCAAAA TCTTGTTGAC TTCAAGAAGT ATTAGTGCTT GTAACCAGAT GAGAGCIGAT AGAATCTTTA 

351 AAATACGAGA AATGCCACXG AATGAAGCAT GGCTTCTTTT CGAAAGAACA GCTAAAAAAG CTCCGAATC7 

421 GCATCAAGTA GCAAGAGATA TCGTGGAOGA GTGTGGTGGG C 
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1 GAATTCGCTG TTGGTAAGAC AACTCTTGCC TCTTCTGTIT ATGATGAAAT CTCTAOCAAC TITCATOGTr 

71 GCTOCITTCT AAAAATATCT GGGAGGAATC AAGTAATAAA GACGC?rATAG AAAGMTOCJl AQAAAAAATC 

141 ATrTCTCATG TTTTGAAACA AGAGCAACTO GGCGTAGGGA GAGTTGAAGA AGGAAAGCGC ATOATAAAQC 

211 ATAOCf l TACA ACAXAGAAAG GTATTGATTG TGCTTGATGA TGTCGACAAC CTTGACCAGC TAGCIaISaAC 

2B1 AGTTQQCroG ATCACATCAT TGGTrTGGrrC AAGGTACCCG CATAATAAIC ACAACTAGAG AlXSAACATtST 

351 ATTAAT TGCA CACAAAGTAG ATCTGATACA CAATATAAGC TTGTTAAACA ACGATCAAGC TATCCATOC 

421 TTCTGCAAGC AAGCACCACG GGGTCACAAA CGTATACAAG ATTATCAGCA ACTTTTAAAA CATOTCCTTP 

491 CTTATQCTGG TGGGCTTCCA CTAGCACTGT CGAC 
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S^'^-Z^O NO'. 3/p 

RLG1-E169 f ^ 

[Stzani] 



1 ATCCT A ACC3 TTCGTACCSAG ANCGCTGTCC CTCCTTOITC •mTGTCATA T GTCATM TC TCATONAT n^ 
71 TGCCACAITTT AATTTPGTGG TTATTITAAA TTAATTTTrA ITCCACATGT GATTTTATGA GTnTTCTAT 
X41 TrTATTGAGT TTCACATAAT ATTrAAATCT AATAACAATA AATGCATATT TATTTnCTT TAAATAAlkCG 
211 CATATAATAT ATAGATTAAA ATCATATAAT ACMACXSTIA AACTCATATA ATACA TATGT TCAT COOCAG 
281 TTTATTTATA IXSTCTC A TCC TTAATTTATT TATTATTTAT TTATTAGAGT AGATGATCIT TCTGATATTA 
351 AAAATTTAAT TTGTTCAAAA TTTAAAATTA TTAATAATCC CACAATTTGA ATAAAATTAA AAA^AATQGN 
421 CCCACCATTA GTCCATCACT TTTICAGCIC ATCAATATCG TXaGTATICT CCTTCGTTIC CACCCTAATC 
491 AATATTTCCA GCGAATGACA GACTCCI A CG GCGmCTBA ATTIGCGTTC CGftCACtGrT CAlTCWtfSGA 
561 GATAATAAAT CAAATGGAGC TGCICCAATG TTCATTCCIG ATGAAAGGTG AATTGTATGT GAAGANAATO 
631 TCAGCGATCTJ ATCTCCATCC GGAACCCACC ACATTATCAG TGTACCACCA AACCACICAA AACGGYGGAA 
701 CSTAGRRAKAC WRKAAAGTCA TGAAGAATAG ATTATlTnU TCCTCATSGG CTGACTGAGG ACXXXKTnTA 
T71 GTTCATCATT TTlCnTCAN CAAAGAATTA TCGGTC CATC GAATITTT AC ATCGACA AAS AAG TITCACT 
B41 ITXCAATCr? ^RGTrAAACA ATTmAATC TrTTTAlCTT 1TCQTTC2AAA CTCCXCAATT OCAACnoCA 
SIX ACITGCAACT TXTGGGCCCA CAAATTIGTQ GTGGGOGTTA ATTTAATCCA CATATTCACT GTAAACAATA 
9B1 ATXCAAATCG ATCTCTGTTC ATCCAATTCA TCAACATCTC TTGATAATTG AAATCATICA CGC TTCATOC 
1051 ATTTCATCCA CATCIATACT ATATTCTCTG CTCTTAICAT ATTAAAOGAT GGCTGAAATC GTICmcro 
U21 CCnCTTGAC A O ' lUfiUTlT GAAAAGCTGG CATYTGAAGC CTXGAAGAAG ATTSTICGCT CCAAAAQAAT 
1191 TQAATCTGAS CTXAAGAAAT 7GAAGGAGAC ATZACSftCCAA ATOCAAGATC TCCITAACXSA TGCrZGCCAC 
13S1 AAOGAAGTAA CTAATGAAGC CGTTAAAABA TCOCriGAATG ATCTCCAACA TXTGGCnAT GACATAGACG 
1331 ACCTACnCA TGATYTTGCA ACTCAAGCTG TICAWCX3TGA GTTGACCXSAG GAGGCTIGGAG CCTCCTCCAG 
1401 TATGGTAAGA AAACTAATCC CAAGTTGTTG CACAAC?nTC TCACAAAGTA ATAGGATGCA TCCCAAGTTA 
1471 GATGATATTG CCACCAGGTr ACAAGAACTG GTAGAQQCAA AAAATAATCT TGGTITAACT GTGATAACAT 
1541 ATSAAAAGCC AAAAATIGAA AGGTATOAGG CGTCTTTGGT AGATGAAAGC GCTACTGTCX3 GACGTGAAGA 
1611 TGATAAGAAA AAATTGCTGG AGAAG CIVIT GGQGGATAAA GATCAATCAG GGAGTCAAAA CTTCAGOOC 
1681 GTQCCCATAS T TG C T A TGGG TGGAGTIOCT AAAACAACTC TAGCXAGACT TTTGTATGAT GAAAABAAAG 
1751 TGAAGGATCA CTTOGAACIC AOQQCTXOGC mU ' mmi: TQKXGAGTTC AGTCnCOCA ATATAAGCM3 
1621 AGTTATTTAT CAATCTCTCA CIGGOQAAAA GAAGGAGTXT GAA GACTIAA ATCTGCTTCA AG AA UCIV IT 
1B91 AAAGAGAAAC TTAGGAACCA GCTATPTCTA ATAGTnTGG ATGATCTGTC GTCTGAAAGC TATGGTGATT 
1561 GGGAGAAATT AGTGGGCCCA TTCCTTGCGG GGTCTCCTGG AAGTAGAATA ATCATGACAA CTCGGAAGGA 
2031 GCAATTGCTC AGAAAGCTGQ GCTTTTCTCA TCAABACCCT CTGGAGGGTC TATCACAAGA TGATGCTITG 

2101 ai;rr:uT: ' : t j ctcaacacgc ATrrGGiGTA ccaaacttxq attcacatcc aacactaaog ccagatggaq 

2171 AACTCnTGT GAAGAAATCT GATOGCITAC CTCTAQCrrT AAGAACACTT GGAAGOTTAT TAMJGACAAA 
22(1 AACAGACSA3 GAAGAATGGA AGGAGCTGTr OGATAGTGAG ATATSGAGCTT TAGGAAAGAG CGATBAGATT 
2311 GTTCCGGCTZ TTAGACTAAG CIACAATGAT CTTTCTGCCW CTTTGAAGCT RTTR'nTGCA TAYTGCTCCT 
23 Bl TGTITCCCiiJV GGACTATGAG TTTGACAAGG AGGAOTTGAT TCTATTGTGG ATGGCAGAAG GCTmTOCA 
2451 CCAACCAACT AYAAACAAGT CAAAGCAACG mGOGTCTT GAATATTm AAGACnTOTT GTCAAGRTCR 
2521 TrrWVCfi^JZ ATGCTCCTAA TRFCAAATCS TTGTnGTSA IGCATGACCT AATGAATBAT TTGGCTACAT 
2591 TTGTTGCTGG AGAATTmT TCAAGGTIAG ACATAGAGAT GAAGAAGGAA TTTAGGATGS AA'IVITIUUA 
3661 RAAGCACCSI CATATGTCAT TIGTATGrTGA GRATTACATA GGTIACAAAA RGTTC3GAGCC ATITAGAOQA 
2731 GCTAAAAATT TCAGAACATT TTTAGCATTG TCTGrTOGGG TDGTAGAAGA TTQGAAGATG TTTIACTTAT 
2801 CAAACAAGGT CTTSAATGAC WTACTTCARG ATnAOCATT GTTAAGGCTC CTRAK'ITIUA TTRRTCITAY 
2871 AATAASYKAG GTACXIARAAK TCGTSGGTAG TATGAASCAC TTGCGGTATC TEAATCtATC WGRAACTTHA 
2941 ATCAQCATT TACCGG AAWA ORTCTGCAAT CmATAATT TACARA CCCT GA TlU TK ' iVr G GCTGTCAOT 
3011 ATZTAGTTAA KTIOCCCAAR ACmCTCAA ASCTTAAAAA TTTGCASCAT TTTBACATGA GQSRTACTOC 
3081 XAA2CITRAAR AACATOCCCT 1AB0GATTGG TOAI CTTGAAA ARTCIACAAA CTCTCmMQ TAACATIGGC 
3151 ATAGCAATAA CCGAGCTPAA GAACTTCCAM AAYCTCCATG GGAA AiriTiU TATIGGCGGG CTGGGAAAAA 
3221 TGGAAAATGC ilGTlXGATGC ACGTTAACCG AACTTGTCTC A:AAAAAGGT TWAATGARTT ANAAACTOGR 
3291 imGGGGGTTG AHIAATITAA TGTnTCCGA AATOGGAACA CZTTQAAAAAA NAAGGTCCTC AATGAATTGA 
3361 ATGCCTCA::A ATQCrAYXCy AAHMAARRRY YWrCARMMAT TWNBKAWRRK GKOmXIRR "ncnHyRAAW 
3431 WAGRGTKI^ KAFGTAGGTT TCATCCAA7C ACCCAAC?IGG GAAAATAGAT GATKITITCA GOQCmCIG 
3501 ATGAGATGTS GAGAOGTATG ATAGGGTOTC TIGGGGCGGT AGAAGAAATA AGCATDCATT CTTOTAATCA 
3571 AATAAGATAT yiGTGGGAAT CACAAGCAGA GGCAACTTAAG GTrCTTATGA ATTTAAAGAA GTrGGATITA 
3641 GGTCAATCTG AAAATITGGT GACTTITAGGG GAGAAAAAGG AGGATAATCA TAATATTAAT AGTCGGAGCA 
3711 GCCTAACATT TTTTAGGAGG TTGAATGTAT GGAGATGTTAA CAGCTIGGAG CATTGCAGGT GTCCAGATAG 
3781 CATGGAGA?vT TTCTTATATGC ACATCTGTGA TTCAATNACA TCCGTCTCCT TCCCAACAGG AGGAGGACAG 
3851 AAGATCAAC7 CACITACCAT CACIGATTGC AAGAAGCTTT CGGAAGAGGA GTIGGGAGGA CX»GAGAGGA 
3921 CAAGAGTGC7 TATAAACTCA AAAAT GCAGA TGCTIGAATC AGTAGATATA CgTA ATIGGC CAAATCTBAA 
3991 ATCTATCAS7 GAATTGAGTT GCXTCATZCA CCTSAACAGA TTATATATAT CAAACTGTCC GAGTRTOQAC 
4061 TCATTTCCTG ACCATGAGTT GCCAAATCTC ACXnCCTTAA CAGATCCAAG GAGAGQACAG CGATTITCGT 
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Further Characterization of RG2 Family Members: 

Further sequencing of cloned RG2 polynucleotide sequences, as discussed 
above, identified additional RG2 species, listed below. Additionally, further sequencing of 
the 5* sections of RG2 sequences listed above resulted in modified and/or new sequence 

5 information, also listed below. The AC15 sequences found in the 3' sections of RG2 
family have not changed. 

Listed below are: four full length species, RG2A, RG2B, RG2C and RG2S; 
two near complete, but with a gap in the largest intron, RG2D and RG2J; three nearly 
complete RG2 gene sequences, RG2K, RG2N, and RG20. The deduced translation 

10 products (polypeptides) encoded by these RG2 species are listed below. The 

• polynucleotide sequences do not contain any gaps (as with some of the polynucleotide 
sequences), because all of the gaps in the sequences are in introns, i.e., there are no gaps 
in exon, or coding, sequences. 

They include: an RG2A polynucleotide sequence (SEQ ID NO: 87) and its 

15 deduced polypeptide sequence (SEQ ID NO:88); an RG2B polynucleotide sequence (SEQ 
ID NO:89) and its deduced polypeptide sequence (SEQ ID NO:90); an RG2C 
polynucleotide sequence (SEQ ID N0:91) and its deduced polypeptide sequence (SEQ ID 
NO:92); an RG2D polynucleotide sequence (SEQ ID NO:93) and (SEQ ID NO:94), and its 
deduced polypeptide sequence (SEQ ID NO:95); an RG2E polynucleotide sequence (SEQ 

20 ID NO:96) and its deduced polypeptide sequence (SEQ ID NO:97); an RG2F 

polynucleotide sequence (SEQ ID NO:98) and its deduced polypeptide sequence (SEQ ID 
NO:99); an RG2G polynucleotide sequence (SEQ ID NO:100) and its deduced polypeptide 
sequence (SEQ ID NO:101); an RG2H polynucleotide sequence (SEQ ID NO:102) and its 
deduced polypeptide sequence (SEQ ID NO: 103); an RG21 polynucleotide sequence (SEQ 

25 ID NO: 104) and its deduced polypeptide sequence (SEQ ID NO: 105); an RG2J 

polynucleotide sequence (SEQ ID NO: 106) and (SEQ ID NO: 107), and its deduced 
polypeptide sequence (SEQ ID NO: 108); an RG2K polynucleotide sequence (SEQ ID 
NO: 109) and (SEQ ID NO:110), and its deduced polypeptide sequence (SEQ ID N0:111); 
an RG2L polynucleotide sequence (SEQ ID NO: 112) and its deduced polypeptide sequence 

30 (SEQ ID N0:113); an RG2M polynucleotide sequence (SEQ ID N0:114) and its deduced 
polypeptide sequence (SEQ ID NO: 115); an RG2N polynucleotide sequence (SEQ ID 
NO: 116) and its deduced polypeptide sequence (SEQ ID NO: 117); an RG20 
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polynucleotide sequence (SEQ ID NO: 118) and its deduced polypeptide sequence (SEQ ID - 
NO: 119); an RG2P polynucleotide sequence (SEQ ID NO: 120) and its deduced 
polypeptide sequence (SEQ ID NO: 121); an RG2Q polynucleotide sequence (SEQ ID 
NO: 122) and its deduced polypeptide sequence (SEQ ID NO:123); RG2S polynucleotide 

5 sequence (SEQ ID NO: 124) and its deduced polypeptide sequence (SEQ ID NO: 125); an 
RG2T polynucleotide sequence (SEQ ID NO: 126) and its deduced polypeptide sequence 
(SEQ ID NO: 127); an RG2U polynucleotide sequence (SEQ ID NO: 128) and its deduced 
polypeptide sequence (SEQ ID NO: 129); and RG2V polynucleotide sequence (SEQ ID 
NO: 130) and its deduced polypeptide sequence (SEQ ID N0:131); and, an RG2W 

10 polynucleotide sequence (SEQ ID NO: 132) and its deduced polypeptide sequence (SEQ ID 
NO: 133). 



Characterization of New RG Family Groups and RG Species: 

Further BAC insert characterization and sequencing, as discussed above, 
15 identified new RG polynucleotide sequences. The new sequences were characterized as 
belonging to new RG families; designated RG5 and RG7. These RG polynucleotides 
sequences, and their predicted translation products (the polypeptides which are encoded by 
these sequences) are summarized and listed below. 

Identified and listed below is an RG5 family member, designated as the RG5 
20 polynucleotide sequence set forth in SEQ ID NO: 134, and its deduced polypeptide 
sequence (SEQ ID NO: 135). This sequence contains an NBS region sequence. 

Also identified and listed below is an RG7 family member, designated as the 
RG7 polynucleotide sequence set forth in SEQ ID NO: 136. No deduced polypeptide 
sequence is given for the new RG7 family member as this sequence appears to be a 
25 pseudogene. 



RG2A polynucleotide sequence (SEQ ID NO:87) 

AAAGTTCATATCCAAGCTTGCCCTCCAACTCTAGCTCCTTCAATGGCACC 
TCCTTCTCTTCAAAAGCACACAAGAACACTTTCAAGCTCAACCACACTCA 
30 CAC.AAGCTCTAGAACGAGGGTTAGGGCACATTTAGGGTTTTGCTCTCTGG 
AAATGGTGTCTAAAAGTGAGGCCATAATGTTCCTTATATAAGGCTCACTC 
CCACAATTAGGCTTTCAATCTGAACGTANTACGCCCAGTGTACACTATGG 
TACGCCCAACGTACTCGGTAGTCTCCGCGTCAANAATACACTCATGAGTA 
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CGCGCAACGTACTTTCCCTTACGCCCAGCGTACTCAAAAGCCAAACATTC 

TnTCAAGGACTAATTTTGACAACTTGAGGAAAGAAAAGGATCAAAGANA 

TATACTTGAATTCCGGGATGTTACAATGAAGTTGANACCTTGGCTAAAAA 

ATTAAATTGGITGTGGAAGCCGTTGGCTGAGCAAGCAACAAGGGTAAAAT 

TCGTAATCTACAAATGGTGTrATTTrCTATTTCTTCTTATTATTTTACTT 

GATTTACGGGTAGTTTTTTTTTCTTACAAAAAATATTAAAGTTGATAAAG 

TATAGCCACTAAAATTGACTTTTTCCAAAACATAATGTCAAATGGTGCGT 

ATATGTATCATGTTGTATTANATAATGAATATGATGATNCTGTTCTATTT 

AANCCGAAAAAATTATCTAATGATTTTATATTGGAAAACAAAGTTGTGAT 

mTNGCATAATATAATCAAATCCNCTTTTGTNTGGGAGGTGGATAAATG 

TGGTAAATITANAACAAGTGTTTTNACNTTGAAGGGTNTGGAAAGGTTGA 

AAAAAGTTAAAATGATAAAATGTTTACACAAATGTTGTATCCGACTGAAT 

ATNATGTTTAAGGATNATTGTATTAAATTGTTGATATATAGTAAGCATAA 

ATATTTAGAATTGTGACTTAAATTTATAAGTTATNCNAACTGGATTGAAA 

CATTTTTGATATANATTAGGAATGAAAATGAGCAACCCTAACATACTTAT 

CTTTGGTAGTTTGGTTATTATATTTTTATTANAATATAGAANCATCCCTT 

TATTTTAAACCCATATTGTGGACGGACTTGAATAAATGGGAAAAATGTAC 

CTTGCTATTTAGCACAAAAAAATTATAAAAATGTACATTGCTATTTAGCA 

CAAACAAAAAAAAAAAACTTATCCTTTTTGCATTAGGTCACAAAGAAATA 

TAA.\ATGGGAAATGTGTTGCTATTTAATGCACTAAAAGAAACTATnTGC 

CTTTATTAAACCGGGTAAACCAATAGAAAAATGGAAGTACATTGTCATTT 

AGCATGAAAAAAAATAACTTTCCATTTTTTGCATCCGGTCACAATAATAG 

AAAAATGAAAGTACGTTGCTATTTAGCGAAACTAACTTCCTTTTTTCTTT 

TTGGCATCGTATCATAAAATATAGACTAAAATACGTTAGTTTTACATTTT 

TAATACATTGAAATGTCTAATCCACATGTTATTCTATAAAAAGGGAAATG 

TAATTTACTTATT(nTrGATTCTTTGGCTTCTTTTTAGTACCCAAAACAT 

CCCTCTATCCATCTATTCCAACTAAAATAATGAAAACTATATTCCTTCCA 

TTGTAGGGATGTTATAAATTTTGTAATTGTTTTTATGCAAAAAAGTGTTT 

TTTGTTAACTAGATTAACGAGATTCATTTTTCAGCATTTTAGGAGAAGTT 

CATCCATCTTTTGGATATGAAGTGCAAGCCAAGTTCTTTAACATGGAATA 

TGAGGTCCCTATATGCTCAAAAAATAGCAAATGAGAAATTTTTTAAATTG 

GATCCCCATAAAAGAAAATTTGTTAATGGTTGTTTTAATATTGGTCAATG 

tgtccaccggatgagcataatactagtttataaggggtaaaggtgggttt 
ggtgggcccatttatctttattatttctaaaagtcagaattaagtaaaaa 
aaattataagataaataccataaggataaaaaatcattttatttggacca 

AAGACCAAAGTTGTTAAGGGGCTGTTTGTTTTTTTTGTGAAGAGCTGTGC 

aaccacttttgtctgcgccgcacagacaacgtgcagacatatgccctcgc 

AGAGTGTTTGTTTTTTGAAAGTGCGCAGACCAAAAAAACGTCTGCGCGAG 

gtcatcctggcgcatatatgtgtcactgtcttcaaaggtcttcagacctc 

ATTTTAACCAAAAAAAAAAAAGACCACCGGTTTTTTTTTTTTITTTNTTC 

tttctcttgtagctgaaaatgcatttttaatctttatgacatgaaattaa 

gtttgaaaaattaatttatttcaacagctgtagacgttaaaaacaaacag 

tcttcttgttgcagactgtggacatttggtccacctcttctaccgcagag 
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ACTTGCAGATGTGGTCCGCAGACTGCAGACATTTTGGCTTCAAATAAACA 
AACATCACCTAATTTGACTACACCACACGGACCTCCAATGTAACAAAAAA 
AAGGTTGAAACAAAGTTGCCTATTTCTCCATATCCAGGGGCCATTTATGT 
AAGAGTTATCTAAATTTTAGTTCGGTAGATCAGTTCTCACATTTTAACCG 
5 GGT.AAAGTGTATGTGTGTACGCGCGCACCTGAAAGGTTTGAANGTAACTT 
CCA.AACTGAANCAANAATCGATATGAAGTATCAAGTTAGAGGTTCAATTG 
GTGAAGGAATCAGCTGGAGGTTGGGGAATCGAGCTTCCACTATTAAGGTA 
AAATCCATAACCCTAAATGTTGGTACGCTCATATATCAAATTGCGTGTTT 
TGTTGAATGAAAAAAGCATGCTCAAAAAACCAGTGTAAGGCACGGTATAT 

10 GACATATTTATAGTTACTGATAACAAATTATGATAATTTTGGGTTTACGT 
AAGTTAGGATTCGTACTTCAACCAAATGTAATAGTTTTTGTGAGTCTATC 
TATGTATTTGGGGAATCACATTAGCAACGGGATTGTACTAGTAATTCGAA 
AAAGTCTTTTAAATAATTTITCTGTTTATAATTTATGAATAGTTTTAGCG 
ACATCTAATATTAAATAGAATGTATCTGATATTGAATTAATGTCCTTAAT 

15 GTG.AACATAGACCTTTTCCATTTACTAATGCCTAATTATTAGTTTCTAAT 
CAATAAATTTTAATTTCTGTTTTATGCTTCTAAGACAATAAAAATCCATG 
ATTTACCTTTAAATATTAACAAAAATGACCATAAATAAATAAAAAATTAG 
GATACCAAACCCCCCCGCCATGCCCAATGTCTAAATATTCTTGATGCTTT 
TGCTTTTCCCTCTTTTCCTTGTTAGTCTATTATTCTGGAGAGTTTGAGAG 

20 AGTTTCATACAAGAAAATTTCAAGAAGAAAGCAAAGGTCCAGGTATTCTC 
TTTTCTTAATTATGTATTAACTTACAAGCATTTTTTACACGATCCATGGT 
TTTTTGTGTATGTTTTTCAAATTGAAACTAGATTGGGACTTTTGCCCrTG 
ATGATTCATAAGATATTGCATGGAGTTGAGATTGTGTAAGAAAAGTGGTG 
AATAGAAAGAGCAAGTGAATCCAGATATAGTATTGGTAATATATGATGAT 

25 GAGATAGAGATATGTTAAAACTGGCTAGAAAATTGTTTTAATTTGAAATT 
TAGGTTGTTGAATTTGAAAGATACCAAGCTAATAACTAATTAGTTATGCT 
AAATAGTTATAAAGAACAACAAACTCGTAGTTITITmCATGATTTTCA 
ACCTCTTCGTACCAAACTAAATTATAACAAAATTGAATATCATTCTCTGC 
AATCAATTTTAACTTTTGTTATTATCATCATGTCTAAAATTGCCACAAGT 

30 TTATTTTCATAGTCATATTGGATTATGAAAGGACTATTTTTACCAATTAC 
ATCTTTACTTTATGGCCAAAGCTAATACAATCCGACTAAACTAAAGGATT 
CTAGGATGCATATAGTTTGCTCCCCGATTATAGATTTCTATCTAATTTGT 
CTATTGTACTAATTTAGGTGCCACCACAAGTAAATTCCTGAAATGGATGT 
CGTTAATGCCATTCTTAAACCAGTTGTCGAGACTCTCATGGTACCCGTTA 

35 AGA.AACACATAGGGTACCTCATTTCCTGCAGGCAATATATGAGGGAAATG 
GGTATCAAAATGAGGGGATTGAATGCTACAAGACTTGGTGTCGAAGAGCA 
CGTGAACCGGAACATAAGCAACCAGCTTGAGGTTCCAGCCCAAGTCAGGG 
GTTGGTTTGAAGAAGTAGGAAAGATCAATGCAAAAGTGGAAAATTTCCCT 
AGCGATGTTGGCAGTTGTTTCAATCTTAAGGTTAGACACGGGGTCGGAAA 

40 GAGAGCCTCCAAGATAATTGAGGACATCGACAGTGTCATGAGAGAACACT 
CTATCATCATTTGGAATGATCATTCCATTCCTTTAGGAAGAATTGATTCC 
ACG.AAAGCATCCACCTCAATACCATCAACCGATCATCATGATGAGTTCCA 
GTC.AAGAGAGCAAACTTTCACAGAAGCACTAAACGCACTCGATCCTAACC 
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ACAAATCCCACATGATAGCCTTATGGGGAATGGGCGGAGTGGGGAAGACG 

ACAATGATGCATCGGCTCAAAAAGGTTGTGAAAGAAAAGAAAATGTTTAA 

mTATAATTGAGGCGGTTGTAGGGGAAAAAACAGACCCCATTGCTATTC 

AATCAGCTGTAGCAGATTACCTAGGTATAGAGCTCAATGAAAAAACTAAA 

CCAGCAAGAACTGAGAAGCTTCGGAAATGGTTTGTGGACAATTCTGGTGG 

TAAGAAGATCCTAGTCATACTCGACGATGTATGGCAGTTTGTGGATCTGA 

ATGATATTGGTTTAAGTCCTTTACCAAATCAAGGTGTCGACTTCAAGGTG 

TTGTTGACATCACGAGACAAAGATGTTTGCACTGAGATGGGAGCTGAAGT 

TAATTCAACTTTTAATGTGAAAATGTTAATAGAAACAGAAGCACAAAGTT 

TATTCCACCAATTTATAGAAATTTCGGATGATGTTGATCCTGAGCTCCAT 

AATATAGGAGTGAATATTGTAAGGAAGTGTGGGGGTCTACCCATTGCCAT 

AAAAACCATGGCGTGTACTCTTAGAGGAAAAAGCAAGGATGCATGGAAGA 

ATGCACTTCTTCGTTTAGAGCACTATGACATTGAAAATATTGTTAATGGA 

GTTTTTAAAATGAGTTACGACAATCTCCAAGATGAGGAGACTAAATCCAC 

CTTTTTGCTTTGTGGAATGTATCCCGAAGACTTTGATATTCTTACCGAGG 

AGTTGGTGAGGTATGGATGGGGGTTGAAATTATITAAAAAAGTGTATACT 

ATAGGAGAAGCAAGAACCAGGCTCAACACATGCATTGAGCGGCTCATTCA 

TACAAATTTGTTGATGGAAGTTGATGATGTTAGGTGCATCAAGATGCATG 

ATCTTGTTCGTGCTTTTGTTTTGGATATGTATTCTAAAGTCGAGCATGCT 

TCCATTGTCAACCATAGTAATACACTAGAGTGGCATGCAGATAATATGCA 

CGACTCTTGTAAAAGACTTTCATTAACATGCAAGGGTATGTCTAAGTTTC 

CTACAGACCTGAAGTTTCCAAACCTCTCCATTTTGAAACTTATGCATGAA 

GATATATCATTGAGGTTTCCCAAAAACTTTTATGAAGAAATGGAGAAGCT 

TGAGGTTATATCCTATGATAAAATGAAATATCCATTGCTTCCCTCATCAC 

CTC.\ATGTTCCGTCAACCTTCGCGTGTTTCATCTACATAAATGCTCGTTA 

GTGATGTTTGACTGCTCTTGTATTGGAAATCTGTCGAATCTAGAAGTGCT 

TAGCTTTGCTGATTCTGCCATTGACCGGTTGCCTTCCACAATCGGAAAGT 

TGAAGAAGCTAAGGCTACTGGATTTGACGAATTGTTATGGTGTTCGTATA 

GAT-\ATGGTGTCTTAAAAAAATTGGTCAAACTGGAGGAGCTCTATATGAC 

AGTGGTTGATCGAGGTCGAAAGGCGATTAGCCTCACAGATGATAACTGCA 

AGGAGATGGCAGAGCGTTCAAAAGATATTTATGCATTAGAACTTGAGTTC 

TTTGAAAACGATGCTCAACCAAAGAATATGTCATTTGAGAAGCTACAACG 

ATTCCAGATCTCAGTGGGGCGCTATTTATATGGAGATTCCATAAAGAGTA 

GGCACTCGTATGAAAACACATTGAAGTTGGTTCTTGAAAAAGGTGAATTA 

TTGGAAGCTCGAATGAACGAGTTGTTTAAGAAAACAGAGGTGTTATGTTT 

AAGTGTGGGAGATATGAATGATCTTGAAGATATTGAGGTTAAGTCATCCT 

CAC.\ACTTCTTCAATCTTCTTCGTTCAACAATTTAAGAGTCCTTGTCGTT 

TCA-\AGTGTGCAGAGTTGAAACACTTCTTCACACCTGGTGTTGCAAACAC 

TTT.\AAAAAGCTTGAGCATCTTGAAGTTTACAAATGTGATAATATGGAAG 

AACTCATACGTAGCAGGGGTAGTGAAGAAGAGACGATTACATTCCCCAAG 

CTG.\AGTTTTTATCTTTGTGTGGGCTACCAAAGCTATCGGGTTTGTGCGA 

TAATGTCAAAATAATTGAGCTACCACAACTCATGGAGTTGGAACTTGACG 

ACATTCCAGGTTTCACAAGCATATATCCCATGAAAAAGTTTGAAACATTT 
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AGTTTGTTGAAGGAAGAGGTAAATATAAATTTTTAATGCTAATACATTAC 
AAAGGATCTTTTCAGTTAAATCTTTCAAAATATATTGTAATTTGATTGTA 
TGGGGTATTATTGTTGGATGGGACTATTAATAAATGATTATCTTGCAGGT 
TCTGATTCCTAAGTTAGAGAAACTGCATGTTAGTAGTATGTGGAATCTGA 

5 AGGAGATATGGCCTTGCGAATTTAATATGAGTGAGGAAGTTAAGTTCAGA 
GAGATTAAAGTGAGTAACTGTGATAAGCTTGTGAATTTGTTTCCGCACAA 
GCCCATATCTCTGCTGCATCATCTTGAAGAGCTTAAAGTCAAGAATTGTG 
GTTCCATTGAATCGTTATTCAACATCCATTTGGATTGTGTTGGTGCAACT 
GGAGATGAATACAACAACAGTGGTGTAAGAATTATTAAAGTGATCAGTTG 

10 TGATAAGCTTGTGAATCTCTTTCCACACAATCCCATGTCTATACTGCATC 
ATCTTGAAGAGCTTGAAGTCGAGAATTGTGGTTCCATTGAATCGTTATTC 
AACATTGACTTGGATTGTGCTGGTGCAATTGGGCAAGAAGACAACAGCAT 
CAGCTTAAGAAACATCAAAGTGGAGAATTTAGGGAAGCTAAGAGAGGTGT 
GGAGGATAAAAGGTGGAGATAACTCTCGTCCCCTTGTTCATGGCTTTCAA 

15 TCTGTTGAAAGCATAAGGGTTACAAAATGTAAGAAGTTTAGAAATGTATT 
CACACCTACCACCACAAATTTTAATCTGGGGGCACTTTTGGAGATTTCAA 
TAGATGACTGCGGAGAAAACAGGGGAAATGACGAATCGGAAGAGAGTAGC 
CATGAGCAAGAGCAGGTAAGGATTTCAATTTCACTGTCTTAATTAATGAT 
TAAGCTCCTGCTTTTTGAATAAAAAAGGGACAAACCATTTCATGACTTAA 

20 TGTAGCAATACAAGTCATGTATAAGAGTGACCAACTCTTTTTTATTTATA 
AAATGACTACAAAATATTTTTTTTCATTAGAGATCATGTATAAATGTGAC 
TAATTTTTCATCACCTAACTTTAGTTGATAAATCTTTATAAATGTCACTA 
GTTACTTTTCAGTAAAATAACAAATTTAATAAATTATCAACAAAAAGCAT 
CAACTAAAAAAATCCCACAACCCGTAATAATTTAAAATAAAAGGATTTAA 

CCAGCAACTCAAGTTTGGAATCGATTCAGCTTAAAACTTGACCAGCATAA 
TTAGATAGATGAGAGTTGAAGCTAAAGTGCCTATATAAGTTCGTTTCATC 
TTTTTTCTTGATCTTGATAGCAAGTTGAATGATTTTCTTCTTCAAAATTG 
ATA.-\AAATCTACATTATAAAGAGACTAGCTTGAAAAAAAATGGTCTAGGT 

30 GGGTCTTGGGTTCTGGTAGATGAAGATGGAAGGGGAGAGTAGATTTCAAA 
GACACAACACATCCTTCATTTTATTTATTTATTATTATTATTATTTm 
ATATCTTGCTCATATTTGTTACAGATATGTGAGGTCTATTAATCTTTTTA 
AATATATAAAAAAATAAATAACATAAATGAGAAAATTAAATAAAGAATAA 
ATTAATAAGGGCACAATAGTCTTTTTAGGTAAGACAAGGACCAAACACGC 

35 AAC.\AAAATAAACAGTAGGGACCATCCGATTTAAAAAAAATAATTAGGGA 
CCAAAAACATAAATTCCCCCAAACCATAGGGACCATTCATGTAATTTACT 

CTAGGTAACGAACTTGTTGAAGTGTTCCCATTTAGGATGTGACCTACTAC 
AACCGATCATAATAGTCATATGTdAACACITCCAACAACnTATTACTTA 
40 GGTGTGTACAAAAAAACAATAGTTACCATGATGTGAACATACTGAAAAAT 
TAATTACCTTAGCAAGTTATTTTCCCATTTAGGTTGTATGGAAACAGTTC 
CGTGAGACCGTGACTTGGATGGTAGATAAATTTAGTAAACTTAACCCTTC 
AATTAACCTACCTTTTTCTTATTAACTCAATTTCAACCTAAATTCTGATT 
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CnTGTTTGAAAGTAAGTTGCATCTTTATTTTTGTATTATOTGTTGCATA 
GGATCCTTAGCATCTTTTAATAATTTATTTGAAGGTGAAAGATCCAACTA 
TTTTTAATCTGTTGGCATTTTCCATCATTTGCAACTGTTTCTTGAAAAAA 
AAATACCTAAAATCAAAATAACCATTTTCAAATCCAAAATTATAAGAGAG 

5 AATTGTAAATGGACATGGAATCATAAATCATTAACACAGTTCAGTAAACA 
AGTTGCTAATTACATTTCTTGCTGTGCAGATTGAAATTCTATCAGAGAAA 
GAGACATTACAAGAAGCCACTGACAGTATTTCTAATGTTGTATTCCCATC 
CTGTCTCATGCACTCTTTTCATAACCTCCAGAAACTTATATTGAACAGAG 
TTAAAGGAGTGGAGGTGGTGTTTGAGATAGAGAGTGAGAGTCCAACAAGT 

10 AGAGAATTGGTAACAACTCACCATAACCAACAACAACCTATTATACTTCC 
CAACCTCCAGGAATTGATTCTATGGAATATGGACAACATGAGTCATGTGT 
GGAAGTGCAGCAACTGGAATAAATTCTTCACTCTTCCAAAACAACAATCA 
GAATCCCCATTCCACAACCTCACAACCATAAAAATTATGTATTGCAAAAG 
CATTAAGTACTTGTTTTCGCCTCTCATGGCAGAACTTCTTTCCAACCTAA 

15 AGCATATCAAGATAAGAGAGTGTGATGGTATTGGAGAAGTTGTTTCAAAC 
AGAGATGATGAGGATGAAGAAATGACTACATTTACATCTACCCACACAAC 
CACCACTTTGTTCCCTAGTCTTGATTCTCTCACTCTAAGTTTCCTGGAGA 
ATCTGAAGTGTATTGGTGGAGGTGGTGCCAAGGATGAGGGGAGCAATGAA 
ATATCTTTCAATAATACCACTGCAACTACTGCTGTTCTTGATCAATTTGA 

20 GGTATGCTTTGTACATATTCAATTATTTATTTAATTTCCTTTTTTATTTG 
CAATATTCTATAAATAATACATTTTATACCCACTATACTAAGATAATAAT 
TACCTAGAGGGATGGATGCTATGACACAGCTGCTACACTTCAGAAACTCT 
AGTAAGGGCAGTTATGGAAGTTCAATAAAATGATAATGGCATCTTTTGAT 
GGGTAATATAGGCAATTTAAGTTTTATTTCTGTTAAAGCAGTATTTAGCA 

25 AGTACTGGCCAGTAGGAGAGGAGAATATCACCTTTTGTGAAAATCTGGTC 
ATTGTACCCAGAATTTAGTTAAATGTAACATTTTAGATATCAGGGGTCAT 
CAGGTGACAGATATTGTAGAATAGAACAATATATAATATCACCCAAAACT 
ATTTITTCTAAGGTTATTCTGTTAAATATGTGCTTTCTTGTTTTCATNGA 
ATTNGCATTCGTATATTTTAGGTGTTAAAGTGATTTTNTCTTCAATAAAT 

30 CCCGAAATTAATTAAAAAAAAAAAAACAAAAGTACATTTTTGATGTGGAG 
AGCACTGGTATCACTTAGTATATAAAAAGCrTGATTTTGAATTAACTTTC 
TTATACAAAAGTTGTGTATATAGTTTAATTAGTTTTACATCATTTTTCCA 
TGTGGTGTTGCAGTTGTCTGAAGCAGGTGGTGTTTCTTGGAGCTTATGCC 
AATACGCTAGAGAGATGAGAATAGAATTCTGCAATGCATTGTCAAGTGTA 

35 ATTCCATGTTATGCAGCAGGACAAATGCAAAAGCTTCAAGTGCTGACAGT 
AAGTGATTGCAAAGGGATGAAGGAGGTATTTGAAACTCAATTAAGGAGGA 
GCAGCAACAAAAACAACAAGAGTGGTGCAGGTGAGGAAGGAATTCCAAGA 
GTAAATAACAATGTTATTATGCTTTCTGGTCTGAAGATATTGGAAATCAG 
CTTTTGTGGGGGTTTGGAACATATATTCACATTCTCTGCACTTGAAAGCC 

40 TGAGACAGCTCCAAGAGTTAAAGATAACATTTTGCTACGGAATGAAAGTG 
ATTGTGAAGAAGGAAGAAGATGAATATGGAGAGCAGTAAACAACAACAAC 
AAC.AACAATAACGAAGGGGGCATCATCATCATCATCTTCTTCATCTTCTA 
AGGAGGTTGTGGTCTTTCCTCGTCTCAAATCCATTGAACTAAATGATGTA 
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CCAGAGCTGGTAGGATTCTTCTTGGGGAAGAATGAGTTCCGGTTGCCTTC 
ATTGGAAGAAGTTACCATCAAGTATTGCTCAAAAATGATGGTGTTTGCAG 
CTGGTGGGTCCACAGCTCCCCAACTCAAGTATATACACACAGAATTAGGC 
AGACATGCTCTTGATCAAGAATCTGGCCTTAACTTTCATCAGGTATATAT 
5 ATTTCTTTAATTGGCATCATCTAATTAAGAAAGATATCATTCCTGCCAAG 
TAA-ATTTACTTCAAACACATTCACACTGGTTTCAGTCTAAGTTTATGTTG 
TTCTAGGAAGGCCAAAATGGGAAAGCAAGATAGGGAAAAATAGTGTATTT 
CAGTGGAAAGGGTATTTTAGGTATTTTCTGTCAAAAGTTGTTATTGCAGG 
CTTTTTAGTACCTGGAATCGTGTGTGGGAGGAGCATTATTATTCTGATTT 

10 GCTTGTTTCTTTATCATTTTTTCTTAGCCTCTGGAACAGCTAGAAACCCT 
TTT.\ATCTTTTGATTTTCAATGACAAAATTTTTCCTGTTACTACATTTGA 
TTGTTGTTCTTCATGGTTCTAAGTGAGTTATTGGCTCATCTGTTACTTCT 
TTTGATTGTTATTTTCATATCATGTTAGTCACnTGAATCAAGCTTTTCTA 
TTTTCAACCAGGGCAAAAGGTCAAAAGTAACCTACTTTATGAGATCAAAA 

15 ACAGCAACCCATCGGATAACTTTTAGTTGGAGTTAATAGTTACAATTACC 
ATTGTGATTAATAATTATAATATCTTGTATTAATTCATAAAAATTGGTAC 
AGCACATATATGACATTTCAAAGGTTTTTGTTTGACATATATATGCCTCT 
GGCGTTTTCTITATTGGACATGCAGACCTCATTCCAAAGTTTATACGGTG 
ACACCTTGGGCCCTGTAACTTCAGAAGGGACAACTTGTTCTTTTCATAAC 

20 TTGATCGAATTATATATGGAATTTAATGATGCTGTTAAAAAGATTATTCC 
ATCCAGTGAGTTGCTGCAACTGCAAAAGCTGGAAAAGATTCATGTGACTT 
ATTGTAATTGGGTAGAGGAGGTATTTGAAACTGCATTGGAAGCAGCAGGG 
AGA.AATGGAAATAGTGGAATTGGTTTTGATGAATCGTCACAAACAACTAC 
CACTACTCTTGTCAATCTTCCAAACCTCAGAGAAATGAAGTTATGGTATC 

25 TAA-\TTGTCTGAGGTATATATGGAAGAGCAATCAGTGGACAGCATTTGAG 
TTTCCAAACCTAACAAGAGTCGATATATGGGGATGTGATAGGTTAGAACA 
TGTATTTACTAGTTCCATGGTTGGTAGTCTATTGCAACTCCAAGAGCTAC 
GCATATGGAACTGCAGTCAGATAGAGGTCGTGATTGTTCAGGATGCAGAT 
GTTTGTGTAGAAGAAGACAAAGAGAAAGAATCTGATGGCAAGACGAATAA 

30 GGAGATACTTGTGTTACCTCGTCTAAAGTCCTTGATATTAAAACACCTTC 
CAWGTCTTAAGGGGTTTAGCTTGGGGAAGGAGGATTTTTCATTCCCATTA 
TTGGATACYTTGGAAATCTACRAATGCCCAGCAATAACCACCTTCACCAA 
GGGAAATTCCRCTACTCCACAGCTAAAAGAAATTGAAACAMATTTTGGCT 
TCTTTTATGCTGCAGGGGAAAAAGACATCAACTCCTCTATTATAAAGATC 

35 AAACAACAGGTAAACCAGATCTTTGTTGCTTNNATAATTCTTAAACNACA 
T^^•GAAAAGCTTCATGCAAGTTTTTTTNGTTATATNGTCAAAAACCGCAA 
CCTACATTTTCAGCTTTANATTTATGTACTTTATGCAGGATTTCAAACAA 
GACTCTGATTAATGTGAAGTGAATATTAAAGGTAAATTATATTTTCATGT 
TCCTAGTNGCCTATTAATTAAAGGCCrmAGTTCGNGATTTTTGGATGT 

40 ATTCTTCATGATGATGTCAATCnTCTAATACCCCATTCATTGTTTGGTTG 
AATGTTGACTCTATGTCAGGATGAATATTCAAGGGAAGAATTGTTCATCA 
TATGAAGGACATTAAAGAACATGGATGCTCTGAAGATGTTGGGAACACA 
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RG2A deduced polypeptide sequence (SEQ ID NO:88) 

MDVWAILKPWETLMW\TaaiIGYUSCRQYMREMGIKMRGLNATRLGVEEHVN 
RNISNQLEVPAQVRGWFEEVGKINAKVENFPSDVGSCFNLKVRHGVGKRASKIIEDI 
DSVMREHSinWNDHSIPLGRmSTKASTSIPSTDHHDEFQSREQTFTEALNALDPNHK 
5 SHMIALWGMGGVGKTTMMHRIiCKVVKEKKMFNFIIEAWGEKTDP^^ 

LGIEU^EKTKPARTEKUOCWFVDNSGGKKILVIUDDVWQFVDLNDIGLSPLPNQGV 
DFK\T.LTSRDKDVCTEMGAEVNSTFNVKMLIETEAQSLFHQFIEISDDVDPELHNIG 
VNI\T?KCGGIPIAIK™ACTUlGKSia)AWKNAUJa.EHYDIENIVNGVF^ 
QDEETKSTFUXGMYPEDFDILTEELVRYGWGimXKVYTIGEARTRLOTaERU 

10 HTNLmEVDDVRCIKMHDLVRAFVIJ>MYSKVEHASIVNHSNTLEW^ 

KRLSLTCKGMSKFPTDUOTNI^nJClMHEDISLRFPKNFYEEMEKLEVISYDm 
PLLPSSPQCSVNLRVFHLHKCSLVMFDCSCIGNLSNLEVLSFADSAIDRLPSTIGKLK 
KUlLIJDLTNCYGVRmNGVUaa.VKI^ELYMTVVDRGRKAISLTDDNCKEMAERS 
KDn'ALEI^FFENDAQPKNMSFEKLQRFQISVGRYLYGDSIKSRHSYENTLKLVLEK 

15 GELLEARMNELFKKTEVLCLSVGDMNDLEDIEVKSSSQLLQSSSFNNLRVLWSKC 
AEUCHFFTPGVANTLKKLEHLEVYKCDNMEELIRSRGSEEETITFPKLKFLSLCGLP 
KI^GLCDNVKIIELPQIJvlEI^lJDDIPGFrSIYPMKKFETFSLLKEEVUPKLEKLHVSS 
MWNTKEIWPCEF>nvlSEEVKFREIKVSNCDKLVNIiTHKPISLlJfflLEEL^ 
ESLFNfflU)CVGATGDEYNNSGVRIIKVISCDKLVNLFPHNPMSILHHLEELEVENC 

20 GSIESLFNIDLDCAGAIGQEDNSISLRNIKVENLGKLREVWRKGGDNSRPLVHGFQS^ 
VESIRVTKCKKFRNVFTPTTTNFNLGALLEISIDDCGENRGNDESEESSHEQEQIEILS 
EKETLQEATDSISNWFPSCLMHSFHNLQKULNRVKGVEVVFEIESESPTSRELVTT 
HHNQQQPnLPNIX2EULWNMDNMSHVWKCSWNKFFnJ»KQQSESPFHNLTT^ 
MYCKSIKYLFSPLMAELI^NLKHIKIRECDGIGEWSNRDDEDEEMTTFTSTHTTTT 

25 LFPSLDSLTLSFLENLKCIGGGGAKDEGSNEISFNNTTATTAVLDQFELSEAGGVSW 
SLCQYAREMRIEFCNALSSVIPCYAAGQMQKLQVLTVSDCKGMKEVFETQLRRSSN 
KNNTCSGAGEEGIPRVNNNVIMl^GlJaLEISFCGGLEHIFrFSALESUlQLQEIJaTFC 
YGNIKVrVKKEEDEYGEQ.TTTTTTITKGASSSSSSSSSKEVVVFPRLKSIELNDVPELV 
GFFLGKNEFRLPSLEEVTKYCSKMMVFAAGGSTAPQLKYIHTELGRHALDQESGL 

30 NFHQTSFQSLYGDTLGPVTSEGTTCSFHNUELYMEFNDAVKKnPSSELLQLQKLEK 
IHVTYCNWVEEVFETALEAAGRNGNSGIGFDESSQTTTTTLVNU>NUIEMKL 
NCUIYIWKSNQWTAFEFPNLTRVDIWGCDRLEHVFTSSMVGSLLQLQELRIWNCSQ 
lEVMVQDADVCVEEDKEKESDGKTNKEILVLPRLKSLILKHLPCLKGFSLGKEDFSF 
PLLDTLEIYKCPAITTFTKGNSTTPQLKEIETHFGFFYAAGEKDINSSIIKIKQQDFKQ 

35 DSD.CEVNK 

RG2B polynucleotide sequence (SEQ ID NO:89) 

TTTTTTAAGATCAGGGATTCAAATTCAGCCCTAGTGATTACAATTGTGTC 

TAA.\CTTTCCCATACCTTCACATTATTGTAAGTATACTTTCTCAGTTTCT 
40 CTCTTGGAAGCTTCCTTGGTATTTTAACTCGTGTTCTAATATTTAACTCT 
GATAGTTATTTTGGCCAATCTACTATCTGCATGTCCGGTTATTGAATCCG 
AAGGCACTGGAATCTTGGATTCCATTCCGTTGTGTGTTTGGTTGCCAAAT 
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GAACGGAATTGAATTATGTAAGATTCCTTCAAAATCCATGTTTAGGTATA 
TCGTTGTTTCTTGGGATGGATGGTAAAGAACGGAATTTCTCCTGTTCATT 
TTTTAATGAAAGACCAAATTGACCTTATAAACCTGTTAAAAAAATTACAT 
TCCAGTTTTCTTAACAAACTGAAAATGGTAAAGGAGTGTGATTGAATTCC 
5 AATCTGTTTCCTGTCCAAAACACGTGACGGAATATTACAATTCCTTCAAA 
TTTCATTTTCTTAAATTGTTATTCCCTTTCTTACAAAAACAAGGTAAACG 
AAACACCCGCTTACTTAATCATACTCCTACATGATGTAAATGAAAAGGGT 
ATAAATGGTATTTTATTCACAGGGATGAGTCACCATGGTCATGAAAGAAT 
CATTAACCGCCCTTACCCAATTCATGTTTGCCCCTAAAATATGATTTAAA 

10 GTAATATTGGCTTATGGGATTCAAGTTGACTTTTTTGTGGCGAAGAAATA 
ATG.WlATCTTCATTTCTAAAGTGTCTTCTACCACTGACATTTTCTAAGA 
AAGAACTTGCTAGAAGAAGGTGGGTTGTTTAGTCTrTTTACTCTTTAAAT 
GTGAAGACTGTTGAGTTATTATTATTATTTTGCCAACTATGGACAACTTG 
TTTAGTTTTTTTTTTTCCCCAATATCCATTTATATGCGATTTATTTCTGA 

15 AAT.AATTTTATCAAAACGCAGGAAACAATGTAGAATAATACTGGTATAAT 
TAATTATATAAAGTTATTAGGCTGAAATCTTGAGGCTACTATAATTTAAT 
TATCATAATTTGAAAATCATCAAA'h'GTATTCCATGTATATTTATGTTAT 
CAGATAATTAATAATATGTGAGCCACACAAATCCACATCATCAGACACCC 
CACCTTATTGTCGGCTACCTCACCACTTGCATGATCCCGACATCTTCCCA 

20 ACCCCACCGACGACTTGGGGTCTCCTTAATATATCAATTATTTTCTGTAA 
GTATTTATTTGTGTAAATGTGTAATGTCATTTTACCTTTTTTCTAATATA 
TACAGAAACATAAATTTTAAATGAAATTCAACTGCGTTTCATTCTTGCAT 
TAA.\AAAAAAGACTGTACTGTTGTCAATATTTTACTTATAACCTGATTAA 
TTA.ATTAAAGCGTAATTGCATAATTTGCATTAGGTTGTAATTTTGTGTTT 

25 TATAGGGAGGGTGAGGGTCACCGGGAATCAAAGCACTTATGTAAAAGCAG 
GGA.\ATACAAAAAATTTACTCGAAACAAATTTTATTCAATTTAAGTGAGA 
TAATAATGTTCTGATTAGATTATGAGAACTAGGAGATTTAAGTGATATAT 
CCCATTTAAAAGAAATTGCATTATTAATTTTGGATCTCTTGATGATGACA 
AAATTAACTCGTGACAGGTTATATATCATATACAAAATGAGTGGCTATGC 

30 TTTCGCTTTCCAAAAAGCAATTATAGTTATACTACACCTACAAATTTTAA 
AAGGGGTTAAACATATCAAAATACTTGATAAGTAATTATATAAATATGCA 
TTTAACCCTCTAAAGAAAATGCTACTAAGCTTGGACCATCTCAGAATTAC 
AATCATACCCTTCCCCTCAAAAAAGATTCGTATATATCATGTCATTTGGC 
ATTCATTTCTTTTTCACAATTCATAGTTCTATTCTCAAAAAATTCGAGTT 

35 CTCGTATTTGTAAGGAAGATCAGAAGAGACTGTTCACACAGGTACTCTCT 
TTTATTTATTGATTCACATTCATATATGTTATTGTTTTCTTGCTTAATGG 
TTTCGTCAGTCTAACTGCGCTTGCTGATTTAAATTTCTTCACTTTCTTCC 
ACGGATTTTTTAAATATTAGTTTTGTGAATGAACAATTGGTGAAGGAAAG 
AAACATGGGAGTCTTTTCTAAAGTAAACCTAGATACTTAGGTTATAAGGG 

40 TATATGCTAAAATGAACTATGCCCATTCACCTTTGCCTTTTCTTTTACTT 
TTTAGTTTTTAGAATCCAAGTTTTCATATGTATCTCGATGTGTGAGAAGA 
ATAGGCATTAGAAAGGTAAAGGACGTACATAAAATTGATTAATTAGTGAA 
TGTTCTTTGATATCATTATTTTTACTCTCATAAAAAGCATATAGATCAAA 
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CAC.\AATTGCTACTTGTTAGTGTAACAACTTCGACTTAATAATGTTAATA 

ATC.\AGATTCTCTTGArrTCAACTATTTTCTAACCGAACAAGCTCACTAA 

AAACTCATATTGCTTTGAGTCTGAGTGGTTTATATTTGGGGTTTTACATT 

TAATTTTTTGTGCATGAATGTGAAAATAGACTGCTTATTGATrCTTTGTG 

TTTCATTGAGTTGATTTTCATTATTACTACCTTACAAATTGCTCAGTGAT 

AGATTTCCATTAATTTGCTAATTCGGTTGCTTCTAAATATGTAGGAGCTA 

CTA.\AAGCAAAAATATCGAGCAATGTCGGACCCAACGGGGATTGCTGGTG 

CCATTATTAACCCAATTGCTCAGACGGCCTTGGTTCCCG1TACGGACCAT 

GTAGGCTACATGATTTCCTGCAGAAAATATGTGAGGGTCATGCAGATGAA 

AATGACAGAGTTGAATACCTCAAGAATCAGTGTAGAGGAACACATTAGCC 

GGAACACAAGAAATCATCTTCAGATTCCATCTCAAACTAAGGAATGGTTG 

GACCAAGTAGAAGGGATCAGAGCAAATGTGGAAAACTTTCCGATTGATGT 

CATCACTTGTTGTAGTCTCAGGATCAGGCACAAGCTTGGACAGAAAGCCT 

TCA.\GATAACTGAGCAGATTGAAAGTCTAACGAGACAACTCTCCCTGATC 

AGTTGGACTGATGATCCAGTTCCTCTAGGAAGAGTTGGTTCCATGAATGC 

ATCCACCTCTGCATCATTAAGTGATGATTTCCCATCAAGAGAGAAAACTT 

TTACACAAGCACTAAAAGCACTCGAACCCAACCAAAAATTCCACATGGTA 

GCCTTGTGTGGGATGGGTGGAGTGGGGAAGACTAGAATGATGCAAAGGCT 

GAAGAAGGCTGCTGAAGAAAAGAAATTGTTTAATTATATTGTTGGGGCAG 

TTATAGGGGAAAAGACGGACCCCTTTGCCATTCAAGAAGCTATAGCAGAT 

TACCTCGGTATACAACTCAATGAAAAAACTAAGCCAGCAAGAGCTGATAA 

GCTTCGTGAATGGTTCAAAAAGAATTCAGATGGAGGTAAGACTAAGTTCC 

TCATAGTACTTGACGATGTTTGGCAATTAGTTGATCTTGAAGATATTGGG 

TTA-\GTCCTTTTCCAAATCAAGGTGTCGACTTCAAGGTCTTGTTGACATC 

ACGAGACTCACAAGTTTGCACTATGATGGGGGTTGAAGCTAATTCAATTA 

TTAACGTGGGCCTTCTAACTGAAGCAGAAGCTCAAAGTCTGTTCCAACAA 

TTTGTAGAAACTTCTGAGCCCGAGCTCCAGAAGATAGGAGAGGATATCGT 

AAGGAAGTGTTGCGGTCTACCTATTGCCATAAAAACCATGGCATGTACTC 

TTAGAAATAAAAGAAAGGATGCATGGAAGGATGCACTTTCGCGCATAGAG 

CACTATGACATTCACAATGTTGCGCCCAAAGTCTTTGAAACGAGCTACCA 

CAATCTCCAAGAAGAGGAGACTAAATCCACTTTTTTAATGTGTGGTTTGT 

TTCCCGAAGACTTCGATATTCCTACTGAGGAGTTGATGAGGTATGGATGG 

GGCTTGAAGCTATTTGATAGAGTTTATACGATTAGAGAAGCAAGAACCAG 

GCTCAACACCTGCATTGAGCGACTGGTGCAGACAAATTTGTTAATTGAAA 

GTGATGATGTTGGGTGTGTCAAGATGCATGATCTGGTCCGTGCTTTTGTT 

TTGGGTATGTTTTCTGAAGTCGAGCATGCTTCTATTGTCAACCATGGTAA 

TATGCCTGGGTGGCCTGATGAAAATGATATGATCGTGCACTCTTGCAAAA 

GAATTTCATTAACATGCAAGGGTATGATTGAGATTCCAGTAGACCTCAAG 

TTTCCTAAACTAACGATTTTGAAACTTATGCATGGAGATAAGTCGCTAAG 

GTTTCCTCAAGACTTTTATGAAGGAATGGAAAAGCTCCATGTTATATCAT 

ACGATAAAATGAAGTACCCATTGCTTCCTTTGGCACCTCGATGCTCCACC 

AACATTCGGGTGCTTCATCTCACTGAATGTTCATTAAAGATGTTTGATTG 

CTCTTCTATCGGAAATCTATCGAATCTGGAAGTGCTGAGCTTTGCAAATT 
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CTCACATTGAATGGTTACCTTCCACAGTCAGAAATTTAAAGAAGCTAAGG 
TTACTTGATCTGAGATTTTGTGATGGTCTCCGTATAGAACAGGGTGTCTT 
GAA.\AGTTTTGTCAAACTTGAAGAATTTTATATTGGAGATGCATCTGGGT 
TTATAGATGATAACTGCAATGAGATGGCAGAGCGTTCTTACAACCTTTCT 
5 GCATTAGAATTCGCGTTCTTTAATAACAAGGCTGAAGTGAAAAATATGTC 
ATTTGAGAATCTTGAACGATTCAAGATCTCAGTGGGATGCTCTTTTGATG 
AAAATATCAATATGAGTAGCCACTCATACGAAAACATGTTGCAATTGGTG 
ACCAACAAAGGTGATGTATTAGACrCTAAACrrAATGGGTTATTTTTGAA 
AACAGAGGTGCTTTTTTTAAGTGTGCATGGCATGAATGATCTTGAAGATG 

10 TTGAGGTGAAGTCGACACATCCTACrCAGTCCrCTTCATTCrGCAATTTA 
AAAGTTCTTATTATTTCAAAGTGTGTAGAGTTGAGATACCTTTTCAAACT 
CAATCTTGCAAACACmGTCAAGACTTGAGCATCTAGAAGTTTGTGAAT 
GTGAGAATATGGAAGAACTCATACATACTGGAATTGGGGGTTGTGGAGAA 
GAGACAATTACTTTCCCTAAGCTGAAGTTTTTATCTTTGAGTCAACTACC 

15 GAAGTTATCAAGTTTGTGCCATAATGTCAACATAATTGGGCTACCACATC 
TCGTAGACTTGATACTTAAGGGCATTCCAGGTTTCACAGTCATTTATCCG 
CAG.\ACAAGTTGCGAACATCTAGTTTGTTGAAGGAAGGGGTAGATATATG 
TTCTTTATGTTAATACAATTTAAATJ^TATTTTCAACCAAATTTTCATAA 
TATATCTGTAATTTGATTGTATGATGTGTTATTGTTTATATGTGGCTATT 

20 AAGGGATGATTATTTTGCAGGTTGTGATTCCTAAGTTGGAGACACTTCAA 
ATTGATGACATGGAGAACTTAGAAGAAATATGGCCTTGTGAACTTAGTGG 
AGGTGAGAAAGTTAAGTTGAGAGCGATTAAAGTGAGTAGCTGTGATAAGC 
TTGTGAATCTATTTCCGCGCAATCCCATGTCTCTGTTGCATCATCTTGAA 
GAGCTTACAGTCGAGAATTGCGGTTCCATTGAGTCGTTATTCAACATTGA 

25 CTTGGATTGTGTCGGTGCAATTGGAGAAGAAGACAACAAGAGCCTCTTAA 
GAAGCATCAACGTGGAGAATTTAGGGAAGCTAAGAGAGGTGTGGAGGATA 
AAAGGTGCAGATAACTCTCATCTCATCAACGGTTTTCAAGCTGTTGAAAG 
CAT.\AAGATTGAAAAATGTAAGAGGTTTAGAAATATATTCACACCTATCA 
CCGCCAATTTTTATCTGGTGGCACTTTTGGAGATTCAGATAGAAGGTTGC 

30 GGAGGAAATCACGAATCAGAAGAGCAGGTAACGCTTTCAATTTCACTTTC 
TTA.\TTAATTAAGGACTAAGCTCCTGTTTTTTGAATAATAAAGAGGTGGG 
ATGACTAAACTTGGGCATCACAATTGCAACAAAATGTTACAAACCATGAA 
ACGTTCAAACCATTTCTTGAATTAAGGTTTCAATACAAGTCATTTAAAAA 
TATGGCTTAAATTTTTTTTATATTTATGTATCAACATGATTTTTCATTAG 

35 AGATCATTATTATAATAGTAAGTTTAAAGCAATTTAAATCAGAACTAATT 
CTAACTTTAGCTAATAAATCGTTATAAATGTAAATAATTACTTTTTAGTG 
AAATAAGCAACGGATTTAATAAGTTAACAACTTAAATGTCATTTCCTAAC 
AAAAAAAACTATTTGGTTCAGAAAAACCGTAATTCAAGATAACTAAAATA 
AAAATATTTGACATTCACTAAGAGCATTITITITTCTAAATATGATTGCA 

40 AATGAATAAAACTTAAATTTATACAGAAAATTCTTTTATATATGTTATAC 
AAAATTTACAAATTGAAATTGGATATGTTAATTAACGGTTTATAATTCTG 
GTATCACAAAGGGATATATAATAAAATATTATTTTCTGTAGTCATTTGTA 
ATTGTACTAGTTTATAACCCGTGGGAACCATGAGTTCTAAAATTAGTTAA 
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ACTTTCATAATAAAAATTTATAATTATTATTTATTTTAAATAAATTATTA 

ATTAAGAGATATATCAAAAATTTAAAGTTATTATAACTTCAAATTTAACA 

TATAATTAGAAAATATATGATCATAACTTTCTGCAACTCTTCTTTTGTAT 

TAAAATGACCAGAGAAGCTCTTAGTATATTTCTAATCAAAGTCTCAAAMC 

TAATGAAGCATATAATTTGTGAAAATCAATTAGCATTAGGTTTTAAGAGT 

CACCAAATTCAAAGAATAATCCAATGCTTTCATTACCACTATGGAGAAAA 

TATTTTCTTAGTTTAAATGAAATGAAAACAAACATTGAAACTAATTGTTG 

CTTATTAAACCAAAGACCCATTACTTAGCCAAGAGTTTAACAAAAAAAAA 

TTACATTCATGTATCATTATTCATGACTAGATATATATGAACATGAAGGG 

AGTTTTTATAGAAAATATAATCATAGATATTCAACATAACTTCAGGGAAT 

TCCTCAAAATAACCAAGTTATTCAAGAAATTACATCCAAGTCAACCAAAG 

AGAAGTTTAGCCTAGCATGGCTAAACTCAAGAAACTAAAATAAGGATTAG 

AAGTACCAAACATGTAGTAAGAATCACAGTAAAAGATGATGTTGTTCTTG 

ATGTTCTTCTAAGTTCTTCAAGTCTCCAGTTGCTCCTAATAATGCAAAGG 

AGAGCCATTAAATTCGTATGTATTGATCCCTTCAAAAGCTGCACCAACCT 

CCCTTAAATAACACTCAAAGCAAAAATGACAAAATTGCCCCTGAAGGACC 

CTATGTGGGTGCCTTGCGCGGGTGGAGCTGCATACGAAAGGTCTTTGGTC 

nTGTGAGGGTGATGTTGTGCGGGATAGCTTGTCGCATGCTTCCGCGCGG 

TTCACGCACATGTGCACAGGTGATGCATGGTGTGTGCGTTCTTGAGTTTT 

GAGCCTCCGATGCTTAGTCCACTTGGCCCAATTCGAGTCCAATCAGCTTA 

TAACCCATTTTTCTTCAAGTTATCTTCAAGTTAAGCCCAATTTGCCTTCT 

CCA-AATCATCCATAACTTCACAGAATCGCCCGTTCATCTTAATCCCGGAT 

GCACAATTATTCTCCCGTCTTCATTTTAAGCAAGATACCACCTTCTTCAT 

GCTTCATCCATCAATAGTACACTTCATGTATCATCTCTACTAGTTATTTA 

GTCCACAATCCTTGTTGTCCTCCAAATTTAATTATCTCATTTAGTTCCCG 

TTCCGCTAGTTTCCTTAAAATTTGCAATTAAGCTCAGAGAAATATTAAGT 

ACCCGAAATGGTCATAAAATAACAAAAAGGAAAATATGCATGAAGATTAA 

CTA-\ATGATGAACGAAATATGCTAAAATAGACTATAAAATGAAGTAAATA 

AAATGAAATTATCGCACTCCGACCACCCTTATAGGCTTGTAGTCCACCCA 

CCCTTCATTCCTTGTACCAATATGGGATGGAAACATCATTAATTAAGCCA 

AAAAGCTAACATATAAGGGTTTAGTGACAAAGGTAAGTACTAAAGATGAA 

AAT.\ATCCATTTTTCTTGTATATACACAACACACACATAGGGGCAGACGT 

AGGATTTCAAAGTACAGATTGTTGGTGGCACATAAGTGTTGCTGGTGACA 

TTTTTTTTTTTTTTTTACGTAGTGGCACAACAGTAGGAAAAACGAAAAAT 

TCGAAATTTTTTACAATTTGTCTAAAAAAAACAGTGGTTGTTGGTGCCAC 

TATGGACACCAAAGTTGAACTGCCCCCACGCGCGCACACACACACACACA 

CATAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAAAGAAAGAAAGAGAGA 

GAGAGTTTGGGATGTGATACTTCTnTAGGAAAATGGAGTTATATCTTTG 

ATATTGTATTTTmAATGTAATTTATATATTTAATCATTTTAGTTTATA 

AGTTTTATTTATTTTGATATGAAAAAAAAAGTCTTTTATACATTGGATTT 

AACATAAAAATCCAACAATATTAATCAAAAAGACCAMACATGTGGACAMW 

TATGTATATAAWTAATTCACAATAGTCTTTAGGAATAGNATTATATATAT 

AATTAATTCTCAATGGTCTTAGGAATAGTAAGTTCTTATATTTCAAACTT 
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TNGCCACAATTCTTTGKTTACTTWGACACTTYCCTCTCTCTAATTATATA 
TATATATATATATATATATATATATATACACACACACACACACACACTAG 
ATGTGTGCCCGCGCAAAGCAGTGACGTNNNGGAGAANACTTTCTTAAGCA 
TAAATAATTATTATATTTTTTATTGGGTATTATATAATAAAAAATTACAA 

5 CTTTTAAATAAAATATTTATGTTTATACTTTATATTTATATTGCTTGTAT 
ACTATTAATATAATAAATTAATATTTATGTCTAATTTATGAAATGTAAAT 
TAATTTAAATACATGAATTTAATATTTTTAAAATTTTCAGTTTGCTTCAA 
ATTGAGTTTCTTAATTATTTTTTTTAATTCANGTATTCAAACTTTTGGTA 
AGTATTAAAGAATTATTTATGCATAATTGATTTATACAAAAAACTTTGTA 

10 ACTTATACATCTTAAAATTCAAGATATAACTAACATGTTTTACAATATAT 
ATATATATATATATATATATATATATATATATATATATATATATATATAT 
TAAAGCGCAAAGGTCATAGGAATAGAATATTTTCTATTATTCTACGTTTT 
GCCACAAAAGTTTGAACACTTTGCCACTTTTTGTCCCTCCnTAACCnT^ 
CAATGTTTTGCGACAAAAGTTCCAAAACTTTGCCACTTTGATCATTCCTC 

15 AACTTTTCACCGCAATTAGTTTGTGGAGTTGGCAGTTTTGATCCCCCTAA 
CTTCGATATTCTCTACTGCTAGCCAAAAAGGGTTCCAGAGTTTCACACTT 
TTGGTCCCTGACAGTAACCAAATGTGAGATGTCAAATTTTTGCCACATTA 
GTTTGTGGAGTTGTCCCmTGGTCCCCCCACATTCGATATTCTANTATA 
CGACCTTATTTTtNTCAAATAACAACACGTATATTTAATTACCAATTATA 

20 GAA.ATAGATATCAAATAAAGTATTTGTAACACTGTGTAAGAACGGTGCTA 
CTATAGGTAAAAATAAACATTTCAAAGTACGATATCCTAATTGGAAAAAG 
AGTTTTAAAAAAATAACGACTAGGGGCGAGTTTTTTTTACAAGTTTGTAT 
CAA.\TCATATCAAAATTTAAGGTGGAACGGTGACCACATTAACCAGAAAT 
GTAATTTATTCTTTGATTTTGATAATTTTTAATATTTTGTTGTGATCTAT 

25 GTATTTAAAAGTAAACAACAAAGAACATAATCCAAAACCCTAAATTGCAA 
GTCTCGCCCAATTTCTCTATCACTAGTCCTCACTTACGATGGCGTTACGT 
CGCTCTCTCACTGCTTACAACCCTTTGTTGCTACTCATTACAATAACGAA 
AAGTTGAATATCCATATATTTATTTGGATGTGGAATTGAACGAATCTCGT 
CAAAATTTTGATTTTGTTGATGGATTTGAGTAGAAGTTTGGGCAGAACGG 

30 GAATGATGGTCTGCAAGTGGTTATAAACTTGATTCTGAGTTATTACTATA 
TATGTAGCCrCTTTACAACGACCAAGGTTTCTTCCAGGTACCATTTGATC 
TTTTTAGAACTTAGTTTTCTGAAACACCCTGATTTGGATCAAATATCACC 
AAC.AACTCTTAAAAACTTGATTAATCAATTGTTTTCTTCATCTTGATAAC 
AAGTGGAATGATTTTCTACTTAGATTAACTTGAAAAAAAAGGTCCATGTG 

35 CGTCTGGTGGATCTGGTAAATGAAGATGGAAGGGAGAGCTGACTTTAAAG 
ACACAAACACGTCACCATATCTCTTATTTTATTTTAAATTTGCTTTTGGT 
GTATTTTCTTTTTTCCTATTTCTTTCTTTCTTGATCTCCAGATGGTATGT 
GGTGTGGATAATTTACACCTAGAGATTGGGAACGATGGGAAGGGGTCTGT 
GATTTATGGCTGGCCGAGTTTTACTTATTAACTCAATTTCAACCTAAATT 

40 CTGATTCTTGTTTGAAAATAAGTTGCATCTTTATTTTTGTATTATCTTGT 
TGCATAGGATCCTTAGCATCTTTTAATAATTTATTTGAAGGTGAAAGATC 
CAACTATTTTTTAGCTGTTGGCATTTTCCATCATTTGCAACTGTTTCTTG 
AAA-AAAAAATACCTAAAATAAAAATAACCATTTTCAAATCCAAAATTATA 
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AGAGAGAATTGTAAATGGACATGGAATCATAAATCATTAACACAGTTCAG 

TAAACAAGTTGCTAATTACATTTCTTGCTGTGCAGATTGAAATTCTATCA 

GAGAAAGAGACATTACAAGAAGCCACTGGCAGTATTTCAAATCTTGTATT 

CCCATCCTGTCTCATGCACTCTTTTCATAACCTCCGTGTGCTTACATTGG 

ATA-^TTATGAAGGAGTGGAGGTGGTATTTGAGATAGAGAGTGAGAGTCCA 

ACATGTAGAGAATTGGTAACAACTCGCAATAACCAACAACAGCCTATTAT 

ACTTCCCTACCTCCAGGATTTGTATCTAAGGAATATGGACAACACGAGTC 

ATGTGTGGAAGTGCAGCAACTGGAATAAATTCTTCACTCTTCCAAAACAA 

CAATCAGAATCCCCATTCCACAACCTCACAACCATAAATATTCTTAAATG 

CAAAAGCATTAAGTACTTGTTTTCGCCTCTCATGGCAGAACTTCTTTCCA 

ACCTAAAGGATATCCGGATAAGTGAGTGTGATGGTATTAAAGAAGTTGTT 

TCA.AACAGAGATGATGAGGATGAAGAAATGACTACATTTACATCTACCCA 

CACAACCACCACTTTGTTCCCTAGTCTTGATTCTCTCACTCTAAGTTTCC 

TGGAGAATCTGAAGTGTATTGGTGGAGGTGGTGCCAAGGATGAGGGGAGC 

AATGAAATATCTTTCAATAATACCACTGCAACTACTGCTGTTCTTGATCA 

ATTTGAGGTATGCTTTGTACATATTCAATTATTTATTTAATTTCCTTTTT 

TATTTGCAATATTCTATAAATAATACATTTTATACCCACTATACTAAGAT 

AAT.^ATTACCTAGAGGGATGGATGCTATGACACAGCTGCTACACTTCAGA 

AACTCTAGTAAGGGCAGTTATGGAAGTTCAATAAAATGATAATGGCATCT 

TTTGATGGGTAATATAGGCAATTTAAGTTTTATTTCTGTTAAAGCAGTAT 

TTAGCAAGTACTGGCCAGTAGGAGAGGAGAATATCACCTTTTGTGAAAAT 

CTGGTCATTGTACCCAGAATTTAGTTAAATGTAACATTTTAGATATTAGG 

GGACATCAGGTGACAGATATTGTAGAATAGAACAATATATAATATTACCC 

AAAACTATTTTTTCTAAGGTTATTCTGTTAAATATGTGCTTTCTTGATTT 

CATTGAATTTGCATTCCTATATTTTAGGTGGTAAAGTGATTGTCTCTTCA 

ATA.\ATCCCGAAATTAATTAAAAAAGAAAAAAACAAAAGTAAATTTTTGA 

TATGGAGAGCACTGGTATCATTTAGTATATAAAAAAACTAGATTTTGAAT 

TAAGTTTCTTATATAAAAGCTGTGTATATAGTTTAATTAGTTTTACATCA 

TTTTTCCATGTGGTGTTGCAGTTGTCTGAAGCAGGTGGTGTTTCTTGGAG 

TTTATGCCAATACGCTAGAGAGATARAAATAGKTGGATGCTATGCATTGT 

CAAGTGTGATTCCATGTTATGCAGCAGGACAAATGCAAAAGCTTCAAGTG 

CTGAGAATAGAGTCTTGTGATGGCATGAAGGAGGTATTTGAAACTCAATT 

AGGGACGAGCAGCAACAAAAACAACGAGAAGAGTGGTTGCGAGGAAGGAA 

TTCCAAGAGTAAATAACAATGTTATTATGCTTCCCAATCTAAAGATATTA 

AGTATTGGAAATTGTGGGGGTTTGGAACATATATTCACATTCTCTGCACT 

TGAAAGCCTGAGACAGCTCCAAGAGTTAAAGATAAAATTTTGCTACGGAA 

TGAAAGTGATTGTGAAGAAGGAAGAAGATGAATATGGAGAGCAGCAAACA 

ACAACAACAACAACGAAGGGGGCATCTTCTTCTTCTTCTTCTTCTTCTTC 

TTCTTCTTCTAAGAAGGTTGTGGTCTTTCCTTGTCTAAAGTCCATrGTAT 

TGGTCAATCTACCAGAGCTGGTAGGATTCTTCTTGGGGATGAATGAGTTC 

CGGTTGCCTTCATTAGATAAACTTAAGATCAAGAAATGCCCAAAAATGAT 

GGTGTTTACAGCTGGTGGGTCCACAGCTCCCCAACTCAAGTATATACACA 

CAAGATTAGGCAAACATACTCTTGATCAAGAATCTGGCCTTAACTTTCAT 
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CAGGTATATATATATTTCTTTAATTGGCATCATCTAATTAAGAAAGATAT 
CATTCCTGCCAAGTAAATTTACTTCAAACACATTCACACTGGTTTCAGTC 
TAAGTTTATGTTGTTCTAAGAAGGCCAAAATGGGAAAGCAAGATAGGGAA 
AAATAGTGTATTTCAGTGGAAAGGGTATTTTAGGCATTTTCTGTCAAAAG 
5 TTGTTATTGCAGGCTTTTTAGTACCTGGAATCGTGTGTGGGAGGAGCATT 
ATTATTCTGATTTGCTTGTTTCTTTATCATTTTTTCTTAGCCTCTCGAAC 
AGCTAGAAACCCTTTTAATCTTTTGATTTTCAATGACGAAATTTTTCCCT 
GTTACTCCATTTGATTGTTGTTCTTCATGGTTCTAAGTGAGTTATTGGCT 
CATCTGTTACITCTTTTGATTGTTATTTTCATATCATGTTGTCCTTTGAA 

10 TCAAGCTTTTCCATTTTCAACCAGGGCAAAAGGTCAAAAGTAACCTACTT 
TATGAGATCAAAAACAGCAACCCATCGGATAACTTTTAGTTGGAGTTAAT 
AGTTACAATTACCATTGTGATTAATAATTATAATATCTTGTATTAATTCA 
TAA.'SiAATTGGTACAGCACATATATGACATTTCAAAGGTTTTTGTTTGACA 
TATATATGCCTCTGGCGTTTTCTTTATTGGACTTGCAGACCTCATTCCAA 

15 AGTTTATACGGTGACACCTTGGGCCCTGCTACTTCAGAAGGGACAACTTG 
GTCTTTTCATAACTTTATCGAATTAGATGTGGAAGGTAATCATGATGTTA 
AAAAGATTATTCCATCCAGTGAGTTGCTGCAACTGCAAAAGCTGGAAAAG 
ATT.AATGTAAGGTGGTGTAAAAGGGTAGAGGAGGTATTTGAAACTGCATT 
GGAAGCAGCAGGGAGAAATGGAAATAGTGGAATTGGTTTTGATGAATCGT 

20 CAC.^AACAACTACCACTACTCTTGTCAATCTTCCAAACCTTAGAGAAATG 
AACTTATGGGGTCTAGATTGTCTGAGGTATATATGGAAGAGCAATCAGTG 
GACAGCATTTGAGTTTCCAAACCTAACAAGAGTTGATATCTATAAATGTA 
AAAGGTTAGAACATGTATTTACTAGTTCCATGGTTGGTAGTCTATCGCAA 
CTCCAAGAGCTACATATATCCAACTGCAGTGAGATGGAGGAGGTGATTGT 

25 TAAGGATGCAGATGATTCTGTAGAAGAAGACAAAGAGAAAGAATCTGATG 
GGGAGACGAATAAGGAGATACTTGTGTTACCTCGTCTAAACTCCTTGATA 
TTA-\GAGAACTrCCATGTCTTAAGGGGTTTAGCTTGGGGAAGGAGGATTT 
TTCATTCCCATTATTGGATACTTTAAGAATTGAGGAATGCCCAGCAATAA 
CCACCTTCACCAAGGGAAATTCCGCTACTCCACAGCTAAAAGAAATTGAA 

30 ACACATTTTGGCTCGTTTTGTGCTGCAGGGGAAAAAGACATCAACTCTCT 
TATAAAGATCAAACAACAGGTAAATCAGATCTTTGTTGCTTTAATAATTC 
TTAAACTACATTTGAAAAGCTTCATGCAAGTTTTTTTTGTTATATTGTCA 
AAAACCGCAACCTACATTTTCAGCTTTATATTTATGTACTTTATGCAGGA 
GTTCAAACAAGACTCTGATTAATGTGAAGTAAATACTAAAGGTAAATTAT 

35 ATTTTCATGTTCCTAGTTGCCTATTAATTAATTGCCTTTTAGTTCATGAT 
TTTTGGATGCATTCTTCATGATGATGTCAATCTTCTAATACCCCATTCAT 
TGTTTGGTTGAATGTTGACTCTATGTCTTGATGAATATTCAAGGGAAGAA 
TTGTTCATCATATGAAGGACATTAAAGAAGAACATGGATGCTATGAAGAT 
GTGGGAAAACAA 

40 
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RG2B deduced polypeptide sequmce (SEQ ID NO:90) 

MSDPTGIAGAIINPIAQTALWVTDHVGYMISCRKYWVMQMKMTELNTSRISVEE 
HISRNTRNHLQIPSQTKEWLDQVEGIRA^WENFPIDV^^CCSIJma^CLGQKAFKITE 
QIESLTRQLSUSWTDDPVPLGRVGSMNASTSASLSDDFPSREKTFTQALKALEPNQK 
FHMVALCGMGGVGKTRMMQRIiCKAAEEKKLFNYIVGAVIGEICrDPFAIQEAIADY 

LGIQLNEKTKPARADKUIEWFKKNSIXK}KTKFIJVU)DW 

GVDFKVLLTSRDSQVCTMMGVEANSnNVGLLTEAEAQSLFQQFVETSEPELQKIGE 

DIVRKCCGLPIAIKTMACTLRNKRKDAWKDALSRIEHYDIHNVAPKVFETSYHNLQ 

EEETKSTFUvICGIJ'PEDFDIPTEELMRYGWGmJDRVYTIREARTRLKrCIERLVQ 

TNUJESDDVGC^^HDLVRAFVLGMFSEVEHASIVNHGNMPGWPDENDMIVHSC 

mSLTCKGMimPVDUCFPKLTIIJaJvlHGDKSUa^>QDFreGMEKLHVISYDKMKY 

PLLPLAPRCSTNmVLHLraCSLmFDCSSIGNI^NLE\T:^FANSfflEWIJ>STVRNLK 

KUILLDLRFCDGUUEQGVIJCSFVKLEEFYIGDASGFIDDNCNEMAERSYNLSALEF 

AFFNNKAE\TCNMSFENLERFKISVGCSFDENINMSSHSYENMLQLVTNKGDVLDSK 

LNGmiCTE\a.Fl^VHGMNDLEDVE\^THPTQSSSFCNLKVLIISKCVELRYLFKL 

NLANTLSRLEHLEVCECENMEELIHTGIGGCGEETITFPKLKFLSLSQLPKLSSLCHN 

VNHGLPHLVDLILKGIPGFTVIYPQNKLRTSSLLKEGVVIPKLETLQIDDMENLEEIW 

PCELSGGEKVKLRAIKVSSCDKLVNLFPRNPMSLLHHLEELTVENCGSffiSLFNIDLD 

CVGAIGEEDNKSLLRSINVENLGKLREVWRIKGADNSHLINGFQAVESIKIEKCKRFR 

NIFTPITANFYLVALLEIQIEGCGGNHESEEQIEILSEKETLQEATGSISNLVFPSCLMH 

SFHNLRVLTLDNYEGVEVVFEIESESPTCRELVTTRNNQQQPIILPYLQDLYLRNMD 

hrrSHVWKCSNWNKFFTLPKQQSESPFHNLTTINIUCCKSIKYlJ'SPIJvlAELl^NUQDI 

RISECDGKEWSNRDDEDEEMTTFTSTHTTTTLFPSLDSLTLSFLENLKCIGGGGAK 

DEGSNEISFNNTTATTAVLDQFELSEAGGVSWSLCQYAREIEIVGCYALSSVIPCYAA 

GQMQKL 

RG2C polynucleotide sequence (SEQ ID NO:91) 

ATAATATTACACAAAGGTAACGTCATTAATTAATTACGATACGAGACAGA 

CTTTTTCACTCGGACATNAACGGTCTATTCCTAACTTNANNTAATTNAAT 

GAATTTAGGATGTGCTAATATGCATGTAANATTCGCTACCGTCATCTTTC 

AAATGACCATATTTTTATGTATTTATAATGAATCAATGAAAAACCGGATT 

TCTATTTAAAATTCTTAAAACTTCATCTTTTAAGCCAGGGTGAATACAAT 

TGCTAGATCCACTGTTAATTTCCATCGAATTATGCCTGATCAATTGTTGG 

CTGCCTACGATGCAGGTGCTACCACAAGAATATGGCCATGGAAACTGCTA 

ATGAAATTATAAAACAAGTTGTTCCAGTTCTCATGGTTCCTATTAACGAT 

TACCTACGCTACCTCGTTTCCTGCAGAAAGTACATCAGTGACATGGATTT 

GAAAATGAAGGAATTAAAAGAAGCAAAAGACAATGTTGAAGAGCACAAGA 

ATCATAACATTAGTAATCGTCTTGAGGTTCCAGCAGCTCAAGTCCAGAGC 

TGGTTGGAAGATGTAGAAAAGATCAATGCAAAAGTGGAAACTGTTCCTAA 

AGATGTCGGCTGTTGCTTCAATCTAAAGATTAGGTACAGGGCCGGAAGGG 

ATGCCTTCAATATAATTGAGGAGATCGACAGTGTCATGAGACGACACTCT 

CTGATCACTTGGACCGATCATCCCATTCCTTTGGGAAGAGTTGATTCCGT 
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GATGGCATCCACCTCTACGCTTTCAACTGAACACAATGACTTCCAGTCAA 
GAGAGGTAAGGTTTAGTGAAGCACTCAAAGCACTTGAGGCCAACCACATG 
ATAGCCTTATGTGGAATGGGGGGAGTGGGGAAGACCCACATGATGCAAAG 
GCTGAAGAAGGTTGCCAAAGAAAAGAGGAAGTTTGGTTATATCATCGAGG 
5 CGGTTATAGGGGAAATATCGGACCCCATTGCTATTCAGCAAGTTGTAGCA 
GATTACCTATGCATAGAACTGAAAGAAAGCGATAAGAAAACAAGAGCTGA 
GAAGCTTCGTCAAGGGTTCAAGGCCAAATCAGATGGAGGTAACACTAAGT 
TCCTCATAATATTGGATGATGTCTGGCAGTCCGTTGATCTAGAAGATATT 
GGTTTAAGCCCTTCTCCCAATCAAGGTGTCGACTTCAAGGTCTTGTTGAC 

10 TTCACGAGACGAACATGTTTGCTCAGTGATGGGGGTTGAAGCTAATTCAA 
TTATTAACGTGGGACTTCTAATTGAAGCAGAAGCACAAAGATTGTTGCAG 
CAATTTGTAGAAACTTCTGAGCCCGAGCTCCACAAGATAGGAGAAGATAT 
TGTTAGGAGGTGTTGCGGTCTACCCATTGCCATCAAAACCATGGCGTGTA 
CTCTAAGAAATAAAAGAAAGGATGCATGGAAGGATGCACTTTCTCGTTTA 

15 CAACACCATGACATTGGTAATGTTGCTACTGCAGTTTTTAGAACCAGCTA 
TGAGAATCTCCCGGACAAGGAGACAAAATCTGTTTTTTTGATGTGTGGTT 
TGTTTCCCGAAGACTTCAATATTCCTACCGAGGAGTTGATGAGGTATGGA 
TGGGGCTTAAAGTTATTTGATAGAGTTTATACAATTATAGAAGCAAGAAA 
CAGGCTCAACACCTGCATTGACCGACTGGTGCAGACAAATTTACTAATTG 

20 GAAGTGATAATGGTGTACATGTCAAGATGCATGATCTGGTCCGTGCTTTT 
GTTTTGGGTATGTATTCTGAAGTCGAGCAAGCTTCAATTGTCAACCATGG 
TAATATGCCTGGGTGGCCTGATGAAAATGATATGATCGTGCACTCTTGCA 
AAAGAATTTCATTAACATGCAAGGGTATGATTGAGTTTCCAGTAGACCTC 
AAGTTTCCTAAACTAACGATTTTGAAACTTATGCATGGAGATAAATCGCT 

25 AAAGTTTCCTCAAGAATTTTATGAAGGAATGGAAAAGCTCCGGGTTATAT 
CATACCATAAAATGAAGTACCCATTGCTTCCTTTGGCACCTCAATGCTCC 
ACCAACATTCGGGTGCTTCATCTCACGGAATGTTCATTAAAGATGTTTGA 
TTGCTCGTGTATTGGAAATCTATCGAATCTGGAAGTGCTGAGCTTTGCTA 
ATTCTTGCATTGAGTGGTTACCTTCCACGGTCAGAAATTTAAAAAAGCTA 

30 AGGTTACTTGATTTGAGATTGTGTTATGGTCTCCGTATAGAACAGGGTGT 
CITGAAAAGTTTGGTCAAACITGAAGAATTTTATATTGGAAATGCATATG 
GGTTTATAGATGATAACTGCAAGGAGATGGCAGAGCGTTCTTACAACCTT 
TCTGCATTAGAATTCGCGTTCTTTAATAACAAGGCTGAAGTGAAAAATAT 
GTCATTTGAGAATCTTGAACGATTTAAGATCTCAGTGGGATGCTCTTTTG 

35 ATGGAAATATCAATATGAGTAGCCACTCATACGAAAACATGTTGCGATTG 
GTGACCAACAAAGGTGATGTATTAGACTCTAAACTTAATGGGTTATTTTT 
GAAAACAGAGGTGCTTTTTTTAAGTGTGCATGGCATGAATGATCTTGAAG 
ATGTTGAGGTGAAGTCGACACATCCTACTCAGTCCTCTTCATTCTGCAAT 
TTAAAAGTCCTTATTATTTCAAAGTGTGTAGAGTTGAGATACCTTTTCAA 

40 ACTCAATGTTGCAAACACTTTGTCAAGACTTGAGCATCTAGAAGTTTGTA 
AATGCAAGAATATGGAAGAACTCATACATACTGGGATTGGGGGTTGTGGA 
GAAGAGACAATTACTTTCCCCAAGCTGAAGTTTTTATCTTTGAGTCAACT 
ACCGAAGTTATCAGGTTTGTGCCATAATGTCAACATAATTGGGCTACCAC 
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ATCTCGTAGACTTGAAACTTAAGGGCATTCCAGGTrTCACAGTCATTTAT 

CCGCAGAACAAGTTGCGAACATCTAGTTTGTTGAAGGAAGAGGTAGATAT 

ATGTTCTTTATGTTAATACAATTTAAACAATATTTTCAACCAAATTTTCA 

TAATATATCTGTAATTTGATTGTATGATGTGTTATTGTTTATATGTGGCT 

ATTAAGGGATGATAATTTTGCAGGTTGTGATTCCTAAGTTGGAGACACTT 

CAAATTGATGACATGGAGAACTTAGAAGAAATATGGCCTTGTGAACTTAG 

TGGAGGTGAGAAAGTTAAGTTGAGAGAGATTAAAGTGAGTAGCTGTGATA 

AGCTTGTGAATCTATTTCCGCGCAATCCCATGTCTCTGTTGCATCATCTT 

GAAGAGCITACAGTCGAGAATTGCGGTrCCAlTGAGTCGTTATTCAACAT 

TGACTTGGATTGTGTCGGTGCAATTGGAGAAGAAGACAACAAGAGCCTCT 

TAAGAAGCATCAACGTGGAGAATTTAGGGAAGCTAAGAGAGGTGTGGAGG 

ATAAAAGGTGCAGATAACTCTCATCTCATCAATGGTTTTCAAGCTGTTGA 

AAGCATAAAGATTGAAAAATGTAAGAGGTTTAGAAATATATTCACACCTA 

TCACCGCCAATTTTTATCTGGTGGCACTTTTGGAGATTCAGATAGAAGGT 

TGCGGAGGAAATCACGAATCAGAAGAGCAGGTAACGCTTTCAATTTCACT 

TTCTTAATTAATTANGGACTAAGCTCCTGTTTTTTGAATAATAAAGAGGT 

GGGATGACTAAACTTGGGCATCACAATTGCAACAAAATGTTACAAACCAT 

GAAACGCTCAAACCATTTCTTGAATTAAGGTTTCAATACAAGTCATTTAA 

AAATATGGCTTAAATTTTTTTATATTTATGTATCAACATGATTTTTCATT 

AGAGATCATTATTATAATAGTAAGTTTAAAGCAATTTAAATTAGAACTAA 

TTCTAACTTTAGCTAATAAATCGTTATAAATGTAAATAATTACTTTTTAG 

TGAAATAAGCAACGGATTTAATAAGTTAACAACTTAAATGTCATTTCCTA 

ACAAAAAAAACTATTTGGTTCAGAAAAACTGTAATTCAAGATAACTAAAA 

TAAAAATATTTGACATTCACTAAGAGCATTTTTTTCTAAATATGATTGCA 

AATGAATAAAACTTAAATTTATACAGAAAAGATTTTTATATATGTTATAC 

AAA.ATTTACAAATTGAAATTGGATATGTTAATTAACGGTTTATAATTCTG 

GTATCACAAAGGGATATATAATAAAATATTATTTTTCTGTAGTCATTTAT 

AATTGTACTAGTTTATAACCCGTGGGAACCATGAGTTCTAAAATTAGTTA 

AACTTTCATAATAAAAATTTATAATTATTATTTATTTTAAATAAATTATT 

AATrAAGAGATATATCAAAAATTTAAAGTTATTATAACTTCAAATTTAAC 

ATATAATTAAAAAATATATGATCATAACTITCCGCAACTCTTCTTTTGTA 

TTA.AAATGACCAGAGAAGCTCTTAGTATATTTTCTAAATCAAAGTCACAA 

AACTAATGAAGCATATAATTTTGTGAAAATCAATTAGCATTAGGTTTTAA 

GAGTCACCAAATTCAAAGAGTAATCCAATGCTTTCATTACCACTATGGAG 

AAAATATTTTCTTAGTTTAAATGAAATGAAAACAAACATTCAAACTAATT 

GTTGCTTATTAAACCAAAGACCCATTACTTAGCCAAGAGTTTAACCAAAA 

AAAATTACATTCATGTATCATTATTAATGACTAGATATATATGAATATGA 

AGGGAGTTTTTATAGAAAATATAATCATAGATA1TCAACATAACTTCATG 

GAATTCCTCAAAATAACCAAGTTATTCAAGAAATTACATCCAAGTCAACC 

AAAGAGAAGTTTAGCCTAGCATGGCTAAACTCAAGAAAATAAAATAAGGA 

TTAGAAGTACCAAACATGTAGTAAGAATCACAGTAAAAGATGATGTTGTT 

CTTGATGTTCTTCTAAGTTCTTCAAGTCTCCAGTTGCTCCTAATAATGCA 

AAGGAGAGCCATTAAATTCGTATGTATTGATCCCTTCAAAAGCTGCACCA 
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ACCTCCCTTAAATAACACTCAAAGCAAAAATGACAAAATTGCCCCTGAAG 
GACGCTATGCGGGTGCCTTGCGCGGGTGGAGCTGAATATGAAAGGTCTTT 
GGTCTTTGTGAGGGTGATGTTGTGCGGGTTAGCTTGTCGCATGCTTCCGC 
GCGGTTCGCGCACATGTGCACAGGTGATGCATGGTGTGTACGTTCTTGAC 
5 TTTTGAGCCTCCGATGCTTAGTCCACTTGGCCCAATTCGAGTCCAATCAA 
CTTATGACCCATTTTTCTTCAAGTTATCTTCAAGTTAAGCCCAATTTGCC 
TTCTCCAAATCATCCATAACTTCACAGAATCGCCCGTTCATCTTAATCCC 
GAATGAACAATTATTCTCCCGTCTTCATTTTAAGCAAGATACCACCTTCT 
TCATGCTTCATCCATCAATAGTACACTTCATGTATCATCTCTACTAGTTA 

10 TTTAGTCCACAGTCCTTGTTGTCCTCCAAATTTAATTATCTCATTTAGTT 
CCCGTTCCGCTAGTTTCCTTAAAATTTGCAATTAAGCTCACAGAAATATT 
AAGTACCCGAAATGGTCATAAAATAACAGAAAGGAAAATATGCATGAAGA 
TTAACTAAATGATGAACGAAATATGCTAAAATAGACTATAAAATGAAGTA 
AATAAAATGAAATTATCGCACTCCGACCACCCTTATAGGCTTGTAGTCCA 

15 CCCACCCTTCATTCCTTGTACCAATATGGGATGGAAACATCATTAATTAA 
GCCAAAAAACTAACATATAAGGGGTGAGTGACAAAGGTAAGTACTAAAGA 
TGAAAAAAATCCATTTTTCTTGTATATACACAACACACACATAGGGGCAG 
ACGTAGGATTTCATAGTACAGATTGTTGGTGGCACATAAGTGTTGCTAGT 
GACATTTTTTTTTTCTTTTACGTAGTGGCACAACAGTARAAAAAACRAAA 

20 AATTCGAAATTTTTTACAATGTGCCTAAAAAAAACAGTGGTTGTTGGTGC 
CACTATGGACACCAAAGTTGAACTGCCCCTGCGCGCGCACACACACACAC 
ACATAAAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGTTTG 
GGATGTGATACTTCTTTTGGGAAAATGGAGTTATATCTTTGATATTGTAT 
TTTTTTAATGTAATTTATATATTTAATCATTTTAGTTTATAAGTTTTA 

25 TATTTKGATATGAAAAAAAAAGTCTTTTATACATTGGATTTAACATAAAA 
ATCCAACAATATTAATCAAAAAGACCAAACATGTGGACAATTATGTATAT 
AATTAATTCACAATAGTCTTTAGGAATAGNATTATATATATAATTAATTC 
TCA.ATGGTCTTAGGAATAGTAAGTTCTTATATTTCAAACNTTTGCCACAN 
TTCTTTGNTTACTTNGACACTTTYCTCTMWNNANWMWWTWATATATATAT 

30 ATATATATATATAHAHAHAHAVACACACACACTAGATGTGTGCCMGCGCA 
AAGCAGTGACGTNNNGGAGAANACTTTCTTAAGCATAAATAATTATTATA 
TTTTTTATTGGGTATTATATAATAAAAAATTACAACTTTTAAATAAAATA 
TTTATGTTTATACTTTATATTTATATTGCTTGTATACTATTAATATAATA 
AATTAATATTTATGTCTAATTTATGAAATGTAAATTAATTTAAATACATG 

35 AATTTAATATTTTTAAAATTTTCAGTTTGCTTCAAATTGAGTTTCTTAAT 
TATTGACCAAACATGTGGACAATTATGTATATAATTAATTCACAATAGTC 
TTTAGGAATAGTATTATATATATAATTAATTCTCAATGGTCTTAGGAATA 
GTAAGTTCTTATATTTCAAACTTTTGCCACAATTCTTTGCTTACTTTGAC 
ACTTTTCCTTCCTAACTTTACATATATATATATATTAAAGCGCAAAGGTC 

40 ATAGGAATATAATATTTTCTATTATTCTACGTTTTGCCACAAAAGTTTGA 
ACACTTTGCCACTTTTTGTCCCTCCTTAACCTTTTCAATGTTTTGCGACA 
AA^AGXTCCAAAACTTTGCCACTTTGATCATTCCTCAACTTTTCACCGCAT 
TAGTTTGTGGAGTTGGCAGTTTTGGTCCCTCTAACTTCGATATTCTCTAC 
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TGCTAGCCAAAAAGGGTTCCAGAGTTTCACACTTTTGGTCCCTGACAGTA 

ACCAAATGTGAGATGTCAAATTTTTGCCACATTAGTTTGTGGAGTTGTCC 

CTTTTGGTCCCCCCACATTCGATATTCTACTATACGATCnTATTTTTCTC 

AAATAACAACACGTATATTTAATTACTAATGATAGAAATAGATATCAAAT 

AAAGTATTTGTAACACTGTGTAGAGTTTTTTTTTACAAGTTTGTATCAAA 

TCATATCAAAATTTAAGGTGGAACGGTGACCACATTAACCAGAAATGTAA 

TTTATTCTTTGATTTTGATAATTmAATATTTTGTTGTGATCTATGTAT 

TTAAAAGTAAACAACAAAGAACATAATCCAAAACCCTAAATTGCAAGTCT 

CGCCCAATTTCTCTATCACTAGTCCTCACTTACGATGGCGTTACGTCGCT 

CTCTCACTGCTTACAACCCTTTGTTGCTACTCATTACAATAACGAAAAGT 

TGAATATCCATATATTTATTTGGATGTGGAATTGAACGAATCTCGTCAAA 

TTTTTGATTTAGTTGATGGATTTGAGTAGAAGTTTGGGCAGAACGGGAAT 

GATGGTCTGCAAGTGGTTATAAACTTGATTCTGAGTTATTACTATATATG 

TAGCCTCTTTACAACGACCAAGGTTTCTTCCAGGTACCATTTGATCTTTT 

TAG.\ACTTAGTTTTCTGAAACACCCTGATTTGGATCAAATATCACCAACA 

ACTCTTAAAAACTTGATTAATCAATTGTTTACTTCATCTTGATAACAAGT 

GGAATGATTTTCTACTTGAAAAAAAAGGTCCATGTGCGTCTGGTGGATCT 

GGTAAATGAAGATGGAAGGGAGAGCTGACTTTAAAGACACAAACACGTCA 

CCATATCrmATTTTATTTTAAATTTTCTTTTTTCCTATTTC^ 

CTTGATCTCCAGATGGTATGTGGTGTGGATAATTTACACATAGAGATTGG 

GAACGACTGTGATTTAGAGAGGACGTGGCTTGGGGTTGAGGATGGTTTAT 

GGCTGGCCGAGTTTCATTTATATAAACAAACAAATATATAAAACAAGGGG 

TAA,AATGGCCATCTTATATGTATTTAACCGTCCTCTTTTTTTTm 

TTTTTTTTTTTTTTTTTTTGTAATTTAAGAAGGGGTATACCAGTGTCAGC 

CTCTTATTCCCAACCAGTCAAATAGGGACTTAGGTTGTTTGGAAACAGTT 

CCGTGAGACCGTGACTTGGATGGTAGATAAATTTAGTAAACTTAACCCTT 

CAATTAACCTACCTTTTTCTTATTAACTCAATTTCAACCTAAATTCTGAT 

TCTTGTTTGAAAATAAGTTGCATCTTTATTTTTGTATTATCTTGTTGCAT 

AGGATCCTTAGCATCTTTTAATAATTTATTTGAAGGTGAAAGATCCAACT 

ATTTTTAATCTGTTGACGTTTTCCATCATTTGCAACTGrrrCTTGAAAAA 

AAAATACCTAAAATCAAAATAACCATTTTCAAATCCAAAATTATAAGAGA 

GAATTGTAAATGGACATGGAATCATAAATCATTAACACAGTTCAGTAAAC 

AAGTTGCTAATTACATTTCTTGCTGTGCAGATTGAAATTCTATCAGAGAA 

AGAGACATTACAAGAAGCCACTGGCAGTATTTCAAATCTTGTATTCCCAT 

CCTGTCTCATGCACTCTTTTCATAACCTCCGTGTGCTTACATTGGATAAT 

TATGAAGGAGTGGAGGTGGTGTTTGAGATAGAGAGTGAGAGTCCAACAAG 

TAGAGAATTGGTAACAACTCACAATAACCAACAACAGCCTATTATACTTC 

CCTACCTCCAGGAATTGTATCTAAGGAATATGGACAACACGAGTCATGTG 

TGG.-^AGTGCAGCAACTGGAATAAATTCTTCACTCTTCCAAAACAACAATC 

AGAATCACCATTCCACAACCTCACAACCATAGAAATGAGATGGTGTCATG 

GCTTTAGGTACTTGTTTTCGCCTCTCATGGCAGAACTTCTTTCCAACCTA 

AAGAAAGTCAAGATACTTGGGTGTGATGGTATTAAAGAAGTTGTTTCAAA 

CAGAGATGATGAGGATGAAGAAATGACTACATTTACATCTACCCACAAAA 
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CCACCAACTTGTTCCCTCATCTTGATTCTCTCACTCTAAACCAACTGAAG 
AATCTGAAGTGTATTGGTGGAGGTGGTGCCAAGGATGAGGGGAGCAATGA 
AATATCTTTCAATAATACCACTGCAACGACTGCTGTTOTGATCAATTTG 
AGGTATGCTTTGTACATATTCAATTATTTATTTAATTTCCTTTTTTAm 
5 GCAATATTCTATAAATAATACATTTTATACCCACTATACTAAGATAATAA 
TTACCTAGAGGGATGGATGCTATGACACAGCTGCTACACTTCAGAAACTC 
TAGTAAGGGCAGTTATGGAAGTTCAATAAAATGATAATGGCATCTTTTGA 
TGGGTAATATAGGCAATTTAAGTTTTATTTCTGTTAAAGCAGTATTTAGC 
AAGTACTGGCCAGTAGGAGAGGAGAATATCACCTTTTGTGAAAATCTGGT 

10 CATTGTACCCAGAATTTAGTTAAATGTAACATTTTAGATATTAGGGGTTA 
TCAGGTGACAGATATTGTAGAATAGAACAATATGTAATATTACCCAAAAC 
TATTTTTTCTAAGGTTGCTCTGTTAAATATGTGCTITCnTGATTTC^ 
AATTTGCATTCCTATATTTTAGGTGGTAAAGTGATTGTCTCTTCAATAAA 
TCCCGAAATTAATTAAAAAAAAAAAAACAAAAGTAAATTTTTGATATGGA 

15 GAGCACTGGTATCATTTAGTATATAAAAAAACTAGATTTTGAATTAAGTT 
TCTTATATAAAAGCTGTGTATATAGTTTAATTAGTTTTACATCATTTTTC 
CATGTGGTGTTGCAGTTGTCTGAAGCAGGTGGTGTTTCTTGGAGCTTATG 
CCA.\TACGCTAGAGAGATAAAAATAGGCAACTGCCATGCATTGTCAAGTG 
TGATTCCATGTTATGCAGCAGGACAAATGCAAAAGCTTCAAGTGCTGAGA 

20 GTA.ATGGCTTGCAATGGGATGAAGGAGGTATTTGAAACTCAATTAGGGAC 
GAGCAGCAACAAAAACAACGAGAAGAGTGGTTGTGAGGAAGGAATTCCAA 
GAGTAAATAACAATGTTATTATGCTTCCCAATCTAAAGATATTAAGTATT 
GGA-\ATTGTGGGGGTTTGGAACATATATTCACATTCTCTGCACTTGAAAG 
CCTGAGACAGCTCCAAGAGTTAACGATTAAGGGTTGCTACAGAATGAAAG 

25 TGATTGTGAAGAAGGAAGAAGATGAATATGGAGAGCAGCAAACAACAACA 
ACA.ACAACGAAGGGGGCATCTTCTTCTTCTTCTTCTTCTAAGAAGGTGGT 
GGTCTTTCCTTGTCTAAAGTCCATTGTATTGGTCAATCTACCAGAGCTGG 
TAGGATTCTTCTTGGGGATGAATGAGTTCCGGTTGCCTTCATTAGATAAA 
CTTATCATCGAGAAATGCCCAAAAATGATGGTGTTTACAGCTGGTGGGTC 

30 CACAGCTCCCCAACTCAAGTATATACACACAAGATTAGGCAAACATACTC 
TTGATCAAGAATCTGGCCTTAACTTTCATCAGGTACATATATATTCCTTT 
AATTGGCATCATCTAATTAAGAAAGATATCATTCCTGCCAAGTAAATTTA 
CTTCAAACACATTCACACTAGTTTCAGTCCAAGTTTATGTTGTTCTAGGA 
AGGCCAAAATGGGAAAGCAAGATAGGGAAAAATAGAGTATTTCAGTGGAA 

35 AGGGTATTTTAGGTATTTTCTGTCAAAAATTGTTATTGCAGGCTIT^ 

TACCTGGAAGAGCATGATTATTCTCGATTTGCTTGTTTCTTTATCATTTT 
TCTTAGCCTAGCATGATTTTCAATGAAATCTTTCCCTGTTACTCCATTTG 
ATTGTTGTTCTTCATGGTTCTAAGTGAGTTAGTGGCTCATCTGTTACTTC 
TTTTGATTGTTATTTTCATAGCATGTTGTCACTTGAATCAAGCTTTTCCA 

40 TTTTCAACAAGGACAAAAGGTCAAAACTAACCTACTTTATGAGATCAAAA 
ATAGCAACCCATCGGATAACTTTTAGTTGGAGTTAATACTTACAATTACC 
ATTGTGATTAATAATTATAATATCTTGTATTAATTCATAAAAATTGGTAC 
AGCACATATATGACATTTCAAAGGTTTTTGTTTGACATATATATGCCTCT 
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GGCGTTTTCTTTATTGGACATGCAGACTTCATTCCAAAGTTTATACGGTG 
ACACCTTGGGCCCTGCTACTTCAGAAGGGACAACTTGGTCTTTTCATAAC 
TTTATCGAATTAGATGTGAAATCTAATCATGATGTTAAAAAGATTATTCC 
ATCCAGTGAGTTGCTGCAACTGCAAAAGCTGGTAAAGATTAATGTAATGT 

5 GGTGTAAAAGGGTAGAGGAGGTATTTGAAACTGCATTGGAAGCAGCAGGG 
AGAAATGGAAATAGTGGAATTGGTTTTGATGAATCGTCACAAACAACTAC 
CACTACTCTTGTCAATCTTCCAAACCTTGGAGAAATGAAGTTACGGGGTC 
TCGATTGTCTGAGGTATATATGGAAGAGCAATCAGTGGACAGCATTTGAG 
TTTCCAAACCTAACAAGAGTTGAAATTTATGAATGTAATTCATTAGAACA 

10 TGTATTTACTAGTTCCATGGTTGGTAGTCTATTGCAACTCCAAGAGCTAG 
AGATTGGTTTGTGCAACCATATGGAGGTCGTGCATGTTCAGGATGCAGAT 
GTTTCTGTAGAAGAAGACAAAGAGAAAGAATCTGATGGCAAGATGAATAA 
GGAGATACTTGTGTTACCTCATCTAAAGTCATTGAAATTACTACTTCTTC 
AAAGTCTTAAGGGGTTTAGCTTGGGGAAGGAGGATTTTTCATTCCCATTA 

15 TTGGATACTTTGGAAATCTACGAATGCCCAGCAATAACCACCTTCACCAA 
GGG.\AATTCCGCTACTCCACAGCTAAAAGAAATGGAAACAAATTTTGGCT 
TCTTTTATGCTGCAGGGGAAAAAGACATCAACTCCTCTATTATAAAGATC 
AAACAACAGGTAAACCAGATCTTTGTTGCTTTAATAATTCTTAAACTACA 
TTTGAAAAGCTTCATGCAAGTTTTTTTTGTTATATTGTCAAAAACCGCAA 

20 CCTACATTTTCAGCTTTATATTTATGTACTTTATGCAGGATTTCAAACAA 
GACTCTGATTAATGTGAAGTGAATATTAAAGGTAAATTATATTTTCATGT 
TCCTAGTTGCCTATTAATTAAAGGCCTTTTAGTTCGTGATTTTTGGATGT 
ATTCTTCATGATGATGTCAATCTTCTAATACCCCATTCATTGTTTGGTTG 
AATGTTGACTCTATGTCAGGATGAATATTCAAGGGAAGAATTGTTCATCA 

25 TATGAAGGACATTAAAGAACATGGTGCTAT 

RG2C deduced polypeptide sequence (SEQ ID NO:92) 

MA^IETANEIIKQVWVI^WmDYIJlYLVSCRKYISDMDIJCMKELKEAKDNVEEH 
KimNISNM^VPAAQVQSWLEDVmNAKVETVPKDVGCCFNLKIRYRAGRDAFNI 

30 lEEIDSVMRRHSLITWTDHPIPLGRVDSVMASTSTLSTEHNDFQSREVRFSEALKALE 
ANHMIALCGMGGVGKTHMMQRLKKVAKEKRKFGYIIEAVIGEISDPIAIQQVVADY 
LCIELKESDKKTRAEKLRQGFKAKSDGGNTKFLIILDDVWQSVDLEDIGLSPSPNQG 
VDFKVLLTSRDEHVCSVMGVEANSnNVGLUEAEAQRLFQQFVETSEPELHKIGEDI 
VRRCCGmAIKTMACTLRNKRKDAWKDALSRLQHHDIGNVATAVFRTSYENLPD 

35 KETKSVFmCGLFPEDFNIPTEELMRYGWGLKLFDRVYTnEARNRLNTCIDRLVQT 
NLUGSDNGVHVKMHDLVRAFVLGMYSEVEQASIVNHGNMPGWPDENDMIVHSC 
mSLTCKGMffiFPVDUOTKLTIIJajkfflGDKSIiaTQEFiEGMEKUlVIS^^ 
PLLPLAPQCST^^RVU^LTECSLKMFDCSCIG^fLSNLEVLSFANSCIEWLPSTVRNLK 
KLRLLDLRLCYGLRIEQGVLKSLVKLEEFYIGNAYGFIDDNCKEMAERSYNLSALEF 

40 AFFNNKAEVKNMSFENLERFKISVGCSFDGNINMSSHSYENMLRLVTNKGDVLDSK 
LNGU'IJCTEVIJ'IJSVHGMNDLEDVEVKSTHPTQSSSFCI^KVUISKCVEUIYLFKL 
NVANTLSRLEHLEVCKCKNMEELIHTGIGGCGEEnTFPKLKFLSLSQLPKLSGLCH 
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NVNmGLPHLVDLKLKGIPGFTVIYPQNKLRTSSLLKEEVVIPKLETLQIDDMENLEEI. 
WPCEI^GGEKVKLREKVSSCDKLVNLFPRNPMSLLHHLEELTVENCGSffiSLFNID 
UjCVGAIGEEDNKSLLRSINVENLGKLREVWRIKGADNSHUNGFQAVESIKIEKCK 
RFR2sTFTPrrANFYLVAIXOQIEGCGGNHESEEQIEIi;SEKEriXJEATGSISM.V^ 
5 LMHSFHNLRVLTLDNYEGVEVWEIESESI^SRELVTTHNNQQQPnLPYLQELYLR 
NMDNTSHVWKCSNW^^CFFTU»KQQSESPFHNLTTIEMRWCHGFRYLFSPLMAELL. 
SNLKKVKILGCDGIKEWSNRDDEDEEMTTFTSTHKTTNLFPHIJ)SLTIJ<QLKNLK 
CIGGGGAKDEGSNEISFNNTTATTAVLDQFELSEAGGVSWSLCQYAREIKIGNCHAL 
SSVIPCYAAGQMQKLQVLRVMACNGMKEVFETQLGTSSNKNNEKSGCEEGIPRVN 

10 NhTVTMLPNmi^IGNCGGLEfflFTFSALESLRQLQELmGCYRMKVIVKKEEDEYG 
EQQTTTTTTKGASSSSSSSKKVVVFPCIJCSIVLVNLPELVGFFLGMNEFRLPSLDKLn 
EKCPKMMVFTAGGSTAPQLKYIHTRLGKHTLDQESGL^fFHQTSFQSLYGDTLGPAT 
SEGTTWSFHNFffiIJ)VKSraDVKKIIPSSEIJ:X3LQKLVKINVMWCKR 
AAGRNGNSGIGFDESSQTTTTTLVNIJ>NmEMKIJlGLDCLRYrWKSNQWTAFEFPN 

15 LTR\^IYECNSLEHVFTSSMVGSLLQLQELEIGLCNHMEVVHVQDADVSVEEDKEK 
ESDGKMNKEILVU'HUCSUa.IXLQSUCGFSLGKEDFSFPIiDTLEIYECPAITTFTK 
GNSATPQLKEMETNFGFFYAAGEKDINSSnKIKQQDFKQDSD. 

R62D polynucleotide sequence (SEQ ID NO:93) and (SEQ ID NO:94) 

20 ACGACCACTATAGGGCGAATTGGGCCCGACGTCGCATGCTCCCGGCCGCC 
ATGGCCGCGGGATGTAAAACGACGGCCAGTCGAATCGTAACCGTTCGTAC 
GAGAATCGCTGTCCTCTCCTTCAACCATTTAATGTATATGAGCTAAATTG 
AAACATCTACTATCATGITTAAATTTATAAACTTTTrCCTTTAGATTCAC 
TTGTCTGGATGTGTTTAATAAAACCCAATTTCCCACATGCGTAGAGATCA 

25 TAGATGTAACTATTGTTAATCAATTTTGCCTGCCAAGTTTTAATAATTAT 
ACTTGGATATTAACAAAACTTTATCTAACGACCAAGGTAATATTAAAAAT 
AGGTTATTATTCTTCATGCTAATTAAAAGATGGGTTGCAAAAGTGAGACC 
ATG.^AAACATTAACACGTTGATATTTTCAACTTTTATTCTTTCATATTCA 
CCATATTTTTTACTTTCGTATTGATTAATCATCTTTCAATCACAGGCTCC 

30 TTGGCAAAAAGTCAGATCTATTAACAAATACTTCCATGTGGTTGCAAATT 
ACAAGGATTTCAACATAATTACCAAAACATAGCATTATCATAAGATCGAA 
TAATAATCAAATTCTTCTATAATATTACACAAAGGTAACGTCATTAATTA 
ATTACGATACGAGACAGACTTTTTCACTCGTGACATCAACGGTCTATTCT 
AACTTTACTTAATTAAATGAATCTAGGATGTGCTCATATGCATGTAATAT 

35 TTGCTACCGTCATCTTTCAAATGACCATATTTTTATGTATTTATAATGAA 
TCAATGAAAAACCGGATTTCTATTTAAAATTCITAAAACTTCATCrTTTA 
AGCCAGGGTGAATACAATTGTAGATCCACTGTTAATTTCCATCGATTATG 
CGTGATCAATTGTTGGCTGCATACGATGCAGGTGCTACCACAAGAATATG 
GCCATGGAAACTGCTAATGAAATTATAAAACAAGTTGTTCCAGTTCTCAT 

40 GGTTCCTATTAACGATTACCTACGCTACGTCGTTTCCTGCAGAAAGTACA 
TCAGTGACATGGATTTGAAAATGAAGGAATTAAAAGAAGCAAAAGACAAT 
GTTGAAGAGCACAAGAATCATAACATTAGTAATCGTCTTGAGGTTCCAGC 
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AGCTCAAGTCCAGAGCTGGTTGGAAGATGTAGAAAAGATCAATGCAAAAG 

TGGAAACTGTTCCTAAAGATGTCGGCTGTTGCTTCAATCTAAAGATTAGG 

TACAGGGCCGGAAGGGATGCCTTCAATATAATTGAGGAGATCGACAGTGT 

CATGAGACGACACTCTCTGATCACTTGGACCGATCATCCCATTCCTTTGG 

GAAGAGTTGATTCCGTGATGGCATCCACCTCTACGCTTTCAACTGAACAC 

AATGACTTCCAGTCAAGAGAGGTAAGGTTTAGTGAAGCACTCAAAGCACT 

TGAGGCCAACCACATGATAGCATTATGTGGAATGGGGAGAGTGGGGAAGA 

CCCACATGATGCAAAGGCTGAAGAAGGTTGCCAAAGAAAAGAGGAAGTTT 

GGTTATATCATCGAGGCAGTTATAGGGGAAATATCGGACCCCATTGCTAT 

TCAGCAAGTTGTAGCAGATTACCTATGCATAGAGCTGAAAGAAAGCGATA 

AGAAAACAAGAGCTGAGAAGCTTCGTCAAGGGTTCAAGGCCAAATCAGAT 

GGAGGTAACACTAAGTTCCTCATAATATTGGATGATGTCTGGCAGTCCGT 

TGATCTAGAAGATATTGGTTTAAGCCCrrCTCCCAATCAAGGTGTCGACT 

TCAAGGTCTTGTTGACTTCACGAGACGAACATGTTTGCTCAGTGATGGGG 

GTTGAAGCTAATTCAATTATTAACGTGGGACTTCTAATTGAAGCAGAAGC 

ACA-\AGATTGTTCCAGCAATTTGTAGAAACTTCTGAGCCCGAGCTCCACA 

AGATAGGAGAAGATATTGTTAGGAGGTGTTGCGGTCTACCCATTGCCATC 

AAAACCATGGCGTGTACTCTAAGAAATAAAAGAAAGGATGCATGGAAGGA 

TGCACTTTCTCGTTTACAACACCATGACATTGGTAATGTTGCTACTGCAG 

TTTTTAGAACCAGCTATGAGAATCTCCCGGACAAGGAGACAAAATCTGTT 

TTTTTGATGTGTGGTTTGTTTCCCGAAGACTTCAATATTCCTACCGAGGA 

GTTGATGAGGTATGGATGGGGCTTAAAGTTATTTGATAGAGTTTATACAA 

TTATAGAAGCAAGAAACAGGCTCAACACCTGCATTGAGCGACTGGTGCAG 

GCAAATTTACTAATTGGAAGTGATAATGGTGTACACGTCAAGATGCATGA 

TCTGGTCCGTGCTTTTGTTTTGGGTATGTATTCTGAAGTCGAGCAAGCTT 

CAATTGTCAACCATGGTAATATGCCTGGGTGGCCTGATGAAAATGATATG 

ATCGTGCACTCTTGCAAAAGAATTTCATTAACATGCAAGGGTATGATTGA 

GATTCCAGTAGACCTCAAGTTTCCTAAACTAA CGAT TTTGAAACTTATGC 

ATGGAGATAAGTCTCTAAAGTTTCCTCAAGAATnTATGAAGGAATGGAA 

AAGCTCCAGGTTATATCATACGATAAAATGAAGTACCCATTGCTTCCTTT 

GGCACCTCAATGCTCCACCAACATTCGGGTGCTTCATCTCACTGAATGTT 

CATTAAAGATGnTGATTGCTCTTCTATCGGAAATCTATCGAATCTGGAA 

GTGCTGAGCTTTGCTAATTCTCGCATTGAATGGTTACCTTCCACAGTCAG 

AAATTTAAAGAAGCTAAGGTTACTTGATCTGAGATTTTGTGATGGTCTCC 

GTATAGAACAGGGTGTCTTGAAAAGTTTGGTCAAACTTGAAGAATTTTAT 

ATTGGAAATGCATATGGGTTTATAGATGATAACTGCAAGGACATGGCAGA 

GCGTTCTTACAACCTTTCTGCATTAGAATTCGCGTTCTTTAATAACAAGG 

CTGAAGTGAAAAATATGTCATTTGAGAATCTTGAACGATTCAAGATCTCA 

GTGGGGTGCTCTTTTGATGGAAATATCAGTATGAGTAGCCACTCATACGA 

AAACATGTTGCAATTGGTGACCAACAAAGGTGATGTATTAGACTCTAAAC 

TTA-\TGGGTTATTTTTGAAAACAGAGGTGCTTTTTTTAAGTGTGCATGGC 

ATG.AATGATCTTGAAGATGTTGAGGTGAAGTCGACACATCCTACTCAGTC 

CTCTTCATTCTGCAATTTAAAAGTCCGTATTATTTCAAAGTGTGTAGAGT 
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TGAGATACCTTTTCAAACTCCATGTTGCAAACACTTTGTCAAGCCTTGAG 
CATCTAGAAGTTTGTGGATGCGAAAATATGGAAGAACTCATACATACTGG 
GATTGGGGGTTGTGGAGAAGAGACAATTACTTTCCCCAAGCTGAAGTCTT 
TATCTTTGAGTCAACTACCGAAGTTATCAGGTTTGTGCCATAATGTCAAC 
5 ATA.ATTGGGCTACCACATCTCGTAGACTTGAAACTTAAGGGCATTCCAGG 
TTTCACAGTCATTTATCCGCAGAACAAGTTGCGAACATCTAGTTTGTTGA 
AGGAAGAGGTAGATATATGTTCTTTATGTTAATACAATTTAAATAATATT 
TTC.\ACCAAAATTTCATAATATATCTGTAATrrGATTGTATGATGTGTTA 
TTGTTTATATGTGGCTATTAAGGGATGATTATTTTGCAGGTTGTGATTCC 

10 TAAGTTGGAGACACTTCAAATTGATGGCATGGAGAACTTAGAAGAAATAT 
GGCCTTGTGAGCTTAGTGGAGGTGAGAAAGTTAAGTTGAGAGAGATTAAA 
GTGAGTAGCTGTGATAAGCTTGTGAATCTATTTCCGCACAATCCCATGTC 
TCTGTTGCATCATCTTGAAGAGCTTAAAGTCAAAAATTGTCGTTCCATTG 
AGTCGTTATTCAACATCGACTTGGATTGTGTCAGTGCAATTGGAGAAGAA 

15 GAC.AACAAGAGCATCTTAAGAAGAATCAAAGTGAAGAATTTAGGGAAGCT 
AAGAGAGGTGTGGAGGATAAAAGGTGCAGATAACTCTCGTCCCCTCATCC 
ATGGCTTTCCAGCTGTTGAAAGCATAAGTATCTGGGGATGTAAGCGGTTT 
AGAAATATATTCACACCTATCACCGCCAATTTTGATCTGGTGGCACTTTT 
GGAGATTCACATAGGAAATTACAGAGAAAATCATGAATCGGAAGAGCAGG 

20 TAACGCTTTCAATTTCACTTTCTTACTTAATTAAGGACTAAGCTCTTGTT 
TTTTGAATAATAAAGAGGTGGGATGACTAAACTTGGGCATCACAATTGTA 
ACA-\AATGTTACAAACCATGAACGTACAAACCATTTCTTGAATTAAGGTT 
TCAATACAAGTCATTTACAAATATGGCTTAAGTTTTTTTATATTTATGTA 
TCAACATTATTTTTCATTAGAGGTCATTATTATAATAGTAAGTTTAAAGC 

25 AATTTAAATTAGCACTAATTTTTCATCATCTAACTTTAGCTAATAAATCG 
TTATAAATGTCAATAGCTAAAATAAAAATATTTGACATTCACTGAGAGCA 
ATTTTTTCTAAACATGATTGCAAATGATTAAAACTTAAATTTAAACTAAA 
AAGATTTTTATATATGTTATACAAAATTTACAAATTGAAATTGGATATGT 
TAATTAACAGTTTATAATTATTGTATTACAAAGCGATATATAATAAAATA 

30 TTATTTTTCTGTAGTCATGTATAATTGTATATGTAAATGATTTTTTAAGA 
TGGTAGAAGTGGAAACTAGTCAATCTCACTTAACTCATTGTCACACCAGT 
TTTATATCCGTTTCTCTCTCrCTCTCTTCTTGCCTCCATCTTTTTTCAAC 
TCATAACACATAAAAATAACATATTTTCCAACACATTTAAGTCACTACCA 
CATCATTATTTTTAATTTAATTAAATTAGAAAATATAAAATTAAATAAAA 

35 CAT.^ACATTTTTTTATTAAAAGGCACTAATACAAATAAAAAGATACACGG 
TAA.\TAAAAAAACGATAATTAGAAAAAAAACATAATAAAAAAAGACAACA 
TTA.\AAATAWAAAGCGACAACTAAAATTAACTAATGATCAAGAAAATTCT 
AAAACTCCCACCATATTTTTCTGCAATTTGTCATTTATGTTCAAACACCA 
TTCGCAGAATCCCTCCTATCAAGTGATCATGTTGATTGAGAAAAAACTGT 

40 ATGTCrCTCTCATGTATCTCCAAGTCCAACAAGTTAGCTTTCATTTCTTC 
ATTTTCTCATGTAAGACGCAAATTTTCATCCCGATATTGTTTTCTATCTT 
CCACCTCTACTTTATTCACAGTGTGGATGAAGGAGAGGACAGCGATTCTC 
GTACGAACGGTTACGATTCGACTGGCCGTCGTTTTACAATCCCGCGGCCA 
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TGGCGGCCGGGAGCATGCGACGTCGGGCCCATTCGCCCTATAGTGGTCGT 
AATACA (SEQIDNO:93) 

Sequence gap 

TGAGCCTCCGATGCITAGTCCACTTGGCACAGTTCAAGTCCAATCAACTT 

5 ATAACCCATTTTTCTTCAAGTTGTCTTCAAGTTAAGCCCAATTTGCCTTC 
TCCAAATCATCCATAACTTCATGGAATCGCCCCTTCATCTTAATCCCGAA 
TGCACAATTATTCTCCCATCTTCATTTTAAGCAAGAGGCCACCTTCTTCA 
TGCTTCATCCATCAATAGTCTGTTGGAATAGTGTCTAAGGCTGCAACTAT 
ATTAGACAAGTATTTGACCCGGTTGTGCATGGTCCITTTGGGTTGCCTTC 

10 ACCATAGCAACTTGATAGGATGATTTATTAAGAGAGAGTAAATATTATTA 
ATATATTATGAGAATAATATAATGAATAATATATTTGTTATTTGATTAAT 
ATAAGTCATAGAATTAATTAGAATTAATTTGGTGACTTAAAGAGATTAAT 
TAAATAAAGGGGTATAAACTGTCAATTGTTTGATAGTTAAGCTTTAGACT 
GTAAATCCATTTGGATATGGTATGGACGAATeCTAAGGGATTTAGGATAG 

15 CTA.^AATCGTCCATATGAGTTATCTAAGAAGGATTTGGATAGCCTTAAGA 
GAAGATTATCTGATAGGGACTTATCTGTAATCCTTAAGGAGTCTACAAGT 
ATA.\ATAGACCCTATGGCTGATGGAATTCGACACATCTCCTAAAGTAAGA 
GAGCCTTGGCCGAATTCCTCCCCTCACCTCTCTCCTAAATCATTCTTCTT 
GCTATTGGTGTTTGTAAGCCATTAGAGGAGTGACATTTGTGACTCTAGAA 

20 TCTCCAAGACCTCAAGATCAACAAGGAATTCAAAGGTATGATTCTAGATC 
TGTTTCAATGTTGTTATTTGTCCTAATTAGTCATTAGAAGACTTGGATTC 
AAAGCATGTTTATTAGAAAGCCTAGATCYGAGCAATAGGGTTTTGCATGC 
GCACATAGGAAAGTTCTTATGGCTAAAACCCATCATAGTCCACTTCATGT 
ATCATCTCTACTAGTTATTTAGTCCATAATCCTTGTTGTCCTCCAAGTTT 

25 AATTACCTCCCTTAGTTCCTGTTCTGCTAGTTTCCTTAAAATTTGCTATT 

AAGATCACAGAACTAGAGAGTACCCAAAATGGTTATAAAATAACAAAAAG 
GAA.AATATGCATGAAGATTAACTAAATTATAAATGTAATATGCTAAAATA 
AACTATAAAAAAAAAGTAAATAAAATGAAACTATCACACTCCGACCACCC 
TTATAGGCTTGTACTGCACCCACCCTTCATTCCTTGTACCAATATGGGAT 

30 GGAAACATTATTCATTAAGCCAAAAAACTAACATTTAAGGGGTGAGTGAC 
AAAGGTAAGTACTAAAGACAACAATAATCCATTTTTCTTGTACATACACA 
ACACACACATAGGGGCGGACGTAGGATTTGTAGTATGTGTTGTGGGTGAC 
ACATTTTTTCTTTTACGTAGTGACACAATAGTAGAGAAAACGAGAAATTC 
CAATTTTTTACATTGTGTTCGAAAAAATATACAGGGGTTGCTGGTGCTAC 

35 TCTGGGCACCAAAGTGGAACCGCCCCTGCACACACACACACATAGAGGGA 
GAGAGAGAGGAGAAAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGATTT 
TGGGATGTGATACTTCTTTTGGGAAAATGGAGCAATATCTTTAATATTGT 
ATTTTTTTAATGTAATTTATATATTTAATCATTTTAGTTTATAACm^ 
GrrirrriTATTTTAATCTGTATATTTAATCATTTCAGTTTATAAGTTTT 

40 ATTTATTTTGGTATACCAGAAAAAAAAGTCTTTTATGTGTTGGATTTAAC 
ATA.AAAATCTAACAATATTAATCAAAAAGACCAAACATGTGGACAATTAT 
GTATATAATTAATTCTCAATGGTCTTAGTGTAACGATATAAATTTCAAAA 
CAATTTTTCACATTAAAAAAAACACTTTCAGTCATAATTGTTATAAATTA 
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TCATTGTATCACAAAATCAGTTCATAACATCACATCCCAAGATCAATAAA 
GTGTAAATACTCCTCATGTGTGTACTAATCAAGCCGACGCCTTCCCGCGA 
TTCTCACTGGTACCTGAAACACGTAACATAACAACTGTAAGCATAAATGC 
TTAGTGAGTTCCCCAAAATACCACATACCACATATATGCCTTTCCAGGCC 

5 ATA,\CTCTGTAGGATCTTCCGACCCAAGTGTCTCAGGGGACTTCCGTCCC 
GAATCCCGGTAGACCTTCCGGTCCTACCCGTATTGACCTTCCGGTCCGTA 
TCATACATAACATACATAACACATACATATCACATAACAACATATAGCAC 
ATACATCTCATAACATAAAAGACCTTCCGGTCACATAAAGGTACCCTTCC 
AGGTACAGTATAGTGAGAANACTCACCTCGTATGATGTCTAATACCTCAC 

10 GTGCTCGATATCCCTGAATCTCGAAACAATGACCTAGCCCCGCCTACTCA 
CATAAAGTAATTATTTCAAATCATTAACGGCTCTCAAGGCTAGACTACAT 
CCCTTTCTATAAATCCACAGAAGGGTAAAAGACCATTTTACCCCTCCTTG 
ACCCAAAAGTCCAAATGTTGATCAAAACCCCAAAAGTCAACGAAAGACAA 
TGGTCAACTTTGACCCTACTCGTGGAGTGCACAAAGGTGACTCGGCAAGT 

15 ACATGCGGGTCCTCTGAATCCTTTCAGTCTCTCTTGGCTCGTCGAGTCTT 
TCTTCCACCCGACGAGTTACACCTGTCATGAATCGCGGGGCAACCCCGAC 
TCGACTTGTCGAGTCCGCTCATGGACTCAACGAGTTCATTCCATGCTCAC 
ACTCAAATGACCTCCTGAGGTCAGATCTGTTCCTCTAATCCATAGATCTG 
ACCTTCCCAAGCTCAATAAACACGTAAAGGTTCGAACTTGATACTCATGC 

20 AACGTCCAAATGATTCTACTTGATGATTTAGCCCCAAATACAACATCCTA 
AGTCCATACGACCTTATTTTTCTCAAATAACAACACATATATTTAATTAC 
CAATGACAGTAATAGATATCATATAAAGTATTTGTAACACTTTGTAAGAA 
CCTTGCTACTATAGGTAAAAAGAAACATTTCAAAGTACATGCCCTAATTA 
GAA,\AAAAGTTATAAAAAAATAATGACTAGGGGCGTGTTTTTTTTACTAG 

25 TTTGTATCAAATTATATCAAAATTTAAGGTGGAAAAGAATGACGACCACA 
TTAACCAGAAATGTAATTATTTTTTTATTTGGTAATTTTTAATATTTGTT 
GTGATCTATGTATTTAAAAGTAAATATCAAACAAGAACATAATCCAAACC 
CTA.^TTGCAAGTCTCGCCCAATTTCTCTATCACTAGTCCTCACTTACGA 
TGGCGTTACGTCGCTCTCTCACTTCCTACAACCCATTGTTGCTACTAATT 

30 ACACTAACGAAAAGTTGAATATCCATATATTTATTTGGATGTGAAATTGA 
ACG.AATCTCGTCAAATTTTTTATTTTGTTGATGGATTTGAGTGGAAGTTT 
AGGCAGAACGGGAATGATGGTCTGCAAGTGGTTATAAACATGGGTGAAGA 
TAAAATGGAGTTGTCGCCGTTGTATTATAGATCTCTTAGGGGTTTGATTC 
TGAGTTATTACTGTATACGTAGCCTCTTTACAACGACCATTCTTCCAAGT 

35 ACCATTTGATCTTTTTAGAATCCAGTTGTCTGAAACACCCTGATTTGGAT 
CAAATATCACCAACAACTCTTAAGAACTGGACTAATTAATTGTTTTCTTG 
ATCTTGATAACAAGAGGAAACACGTCACCATATCTTTTATTTTAAATTTG 
CTTTTGGTGTATTTTCTTTCTTCCCATTTCTTTCTTGATCTGTTCCAGAT 
GGTATTTGGTGTGGATAATTTACACCTGGAGATTGTGAACGATGGGAAGG 

40 GGTATGTGATTTACAGAGGATGTGGCTTGTGGTTGAGGATGGTTTATGGC 
TGGCCGAGTCTAATTTATATTTATATAAACAAATAAATATATAAAACAAG 
GGTAAAATATGTATTTAAGCGTCCTCTTTTAATGGTGACAATTTTTACAG 
TTTACTCTCTTTGTTTTTTAATTGTGATGCCCACGATCGAACTCATTCAT 
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CCCCCCCCCTTTTTTTTTTAAAATAAAAAATTAAGAAGGGGTACCACCAT 
ATACCCGTGTCAGCTTCTTATTCCCAAGCAGTCAAATAGGGACTTAGGTT 
GTATGGAAACAGTTCCGTGACTTGGATGGCAGATAAATTTAGTAAACTTA 
ACCCTTCAATTAACCTACCnrmOTATTAACTCAATTTCAAGCTAAAT 

5 TCTGATTCTTGTTTGAAAATAAGTTGCATCTTTATTTTTGCATATTATCT 
TGTTGCATAGGATCCTTAGCATCTTTTAATAGTTTATTTGAAGCTGAAAG 
ATCCAACTAGTTTTGATCTGTTGGCATTTTCCATCATTTGCAACTGTTTC 
TTGAAAAAAAATACCTAAAATCAAAATAACCATTTTCAAATCCAAAATTA 
TAAGAGAGAATTGTTAATGGACGTGGAATCATAAATCATTAACACAGTTC 

10 AGTACACAAGTTGCTAATTACATTTCTTGCTGTGCAGATTGAAATTCTAT 
CAGAGAAAGAGACATTACAAGAAGTCACTGATACTAATATTTCTAATGAT 
GTTGTATTATTCCCATCCTGTCTCATGCACTCTTTTCATAACGTCCATAA 
ACTTAAATTGGAAAATTATGAAGGAGTGGAGGTGGTGTTTGAGATAGAGA 
GTGAGAGTCCAACATGTAGAGAATTGGTAACAACTCACAATAACCAACAA 

15 CAGCCTATTATACTTCCCAACCTCCAGGAATTGTATCTAAGGAATATGGA 
CAACACGAGTCATGTGTGGAAGTGCAGCAACTGGAATAAATTCTTCACTC 
TTCCAAAACAACAATCAGAATCACCATTCCACAACCTCACAACCATAGAA 
ATGAGATGGTGTCATGGCTTTAGGTACTTGTTTTCGCCTCTCATGGCAGA 
ACTTCTTTCCAACCTAAAGAAAGTCAAGATACTTGGGTGTGATGGTATTG 

20 AAGAAGTTGTTTCAAACAGAGATGATGAGGATGAAGAAATGACTACATTT 
ACATCTACCCACACAACCACCAACTTGTTCCCTCATCTTGATTCTCTCAC 
TCTAAAATACATGCACTGTCTGAAGTGTATTGGTGGAGGTGGTGCCAAGG 
ATGAGGGGAGCAATGAAATATCTTTCAATAATACCACTACAACTACCGAT 
CAATTTAAGGTATGTTTGTACATATTTAATTATATATTTAATTTCCTTGT 

25 TAATTTCCTTTTCTTTGCAATATTCTATGCGAACTCAAGAATGGGATTTG 
GAGGCATATAAAGTTACATTCATTTGAACAAGTATTACCTTTTATTTGTT 
ATTTATCATTTTCATATCAAGTACCTATAACATTTCTTTTTTATTTTTCT 
AATTAGAAGAGGTCCACATGTCTAATTAGGTTTTCCATTCTATGTGTAAC 
CTCTATTCTCTCTGTAATCAAGCATCTTAGATTATTTATCCATTTTCATA 

30 ATTGTGTTTATTTTTACAGTTTTTTTTTTTATT^ 

TTTTAATTTATTTATTATTTTTTTTTTGGTAATTGCAACCTGTCATATAT 
TCAAGTCTTAATGTAACATAATAATACATTTTATACCCACTATACTAAGA 
TAATAATTACCTAAAGGGATGGATGCCATGACACTGCTACACTTCAGNAA 
CTCTAGTAAGGGCAGTTATGGAAGTTCAATAAAATGATAATGGCATCTTT 

35 TGATGGGTAATATAGGCAATTTAAGTTTTATTTCTGTTAAAGCAGTATTT 
AGCTAGTAGTGGCCAGTAGGAGAGGAGAATATCACCTTTTGTCAAAATCT 
GGTCATTGTACCCAGAATTTAGTTAAATGTAACATTTTAGATATTAGGGG 
TCATCAGGTGACAGATATTGTAGAATAGAACAATATGTAATATTACCCAA 
AACTATTTTTTCTAAGGTTGCTCTGTTAAATATGTGCTTTCm'GATTTCA 

40 TTGAATTTGCATTCGTATATTTTAGGTGGTAAACTGATTGTCTCTTCAAT 
AAATCCTGAAATTAATTAAAAAAAAAAAAACAAAAGTACATTTTTGATTT 
GGAGAGCACTGGTATCATTTAGTATAGAAAAAAACTAGATTTTGAATTAY 
CTTTCTTATATAAAAGTTGTGTATATAGTTTAATTAGTTTTACATCATTT 
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TTCTATGTGTTGTTGCAGTTGTCTGAAGCAGGTGGTGTTTGTTGGAGCTT 
ATGCCAATACTCTAGAGAGATAGAGATATATAGGTGTGATGCACTGTCAA 
GTGTAATTCCATGTTACGCAGCAGGACAAATGCAAAAGCTGCAAGTGCTG 
ACAGTCAGTTCTTGTAATGGTCTGAAGGAGGTATTTGAAACTCAATTAGG 

5 GACGAGCAGCAACAAAAACAACGAGAAGAGTGGTTGTGAGGAAGGAATTC 
CAAGAGTAAATAACAATGTTATTATGCTTCCCAATCTAAAGATATTGGAA 
ATCTACGGTTGTGGGGGTTTGGAACATATATTCACATTCTCTGCACTTGA 
AAGCCTGAGACAGCTCCAAGAGTTAACGATTAAGGGTTACTACTCTTGTC 
AATCTTCCAAACCTCAAAGAAATGAGGTTGGAGTGGCTAAGTAATCTGAG 

10 GTATATATGGAAGAGCAATCAGTGGACAGCATTTGAGTTTCCAAACCTAA 
CAAGAGTTGAAATTTGTGAATGTAATTCATTAGAACATGTATTTACTAGT 
TCCATGGTTGGTAGTCTATTGCAACTCCAAGAGCTACATATATTTAACTG 
CAGTCTGATGGAGGAGGTAATTGTTAAGGATGCAGATGTTTCTGTAGAAG 
AAGACAAAGAGAAAGAATCTGATGGCAAGACGAATAAGGAGATACTTGTG 

15 TTACCTCATCTAAAGTCCTTGAAATTACAACTTCTTCGAAGTCTTAAGGG 
GTTrAGCTTGGGGAAGGAGGATTTTTCATTCCCATTATTGGATACTTTAG 
AAATCAAAAGATGCCCAACAATAACCACCTTCACCAAAGGAAATTCCGCT 
ACTCCACAACTAAAAGAAATACAAACAAATTTTGGCTTCTTTTATGCTGC 
AGGGGAAAAAGACATCAACTCTCTTATAAAGATCAAACAACAGGTAAATC 

20 AGATCTTTGTTGCTTTAATAATTCTTAAACTACATTTGAAAAGCTTCATG 
CAAGTTTTTTTGTTATATTGTCAAAAACCGCAACCTACATTCAGCTTTAT 
ATTTATGTACTTTATGCAGGATTTCAAACAAGACTCAGATTAATGTGAAG 
TGAATATTAAAGGTAAATTATATTTTCATGTTCCTAGTTGCCTATTAATT 
AATGGCCTTTTAGTTCATGATTTTTGGATGTATTCTTCATGATGATGTGA 

25 ATCTTCTAATACCCCATTCATTGTTTGGTTGAATGTTGACTCTATGTCAG 

GATGAATATTCAAGGGAAGAATTGTTCATCAWATGAAGGACATTAAAGAA 
CATGGATGCTATGAAGATGTTGGGAAAACATATGTATCAAGTGGCAARCT 
GCTTAATGATCTAAGTTTGTTGGTTGANGATGTTGATTTTAATATTTCAA 
ATTCATTGGTTATATGGGCTTATCAATAGTGTTAATGGGATAATGAGTGA 

30 CTT.AACCTAAATTATGTTGTTGGTAAATGTTGGACAAGTATGGAAAATTA 
GGAATGACTTGTGAAAAAAAAATAAAAAAAAA (SEQ ID NO:94) 

RG2D deduced polypeptide sequence (SEQ ID NO:95) 

MA^IETANE^KQVVPVU4WINDYLRYVVSCRKYISDMDLKMKELKEAKDNVEE 
35 HKNHNISNRLEVPAAQVQSWLEDVEKINAKVETVPKDVGCCFNLKIRYRAGRDAF 
NHEEIDSVMRRHSLITWTDHPIPLGRVDSVMASTSTLSTEHNDFQSREVKFSEALKA 
LEA>raMIALCGMGRVGKTHMMQRLKKVAKEKRKFGYIIEAVIGEISDPIAIQQVVA 
DYLCffiUCESDKKTRAEKUlQGFKAKSDGGmTa'UIIJJDVWQSVDLEDIGLSPSPN 
QGM^FKVLLTSRDEHVCSVMGVEANSIINVGLLIEAEAQRLFQQFVETSEPELHKIG 
40 EDI\TUICCGLPIAIKTMACTLRNKRKDAWKDALSRLQHHD1GNVATAVFRTSYENL 
PDKETKSVFUVICGLFPEDFNIPTEElMRYGWGLKLFDRVYTnEARNRLNTCIERLV 
QANXLIGSDNGVHVKMHDLVRAFVLGMYSEVEQASIVNHGNMPGWPDENDMIVH 
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SCKMSLTCKGMIEIPVDLKITKLTILKIJ^GDKSLKFPQEFYEGMEKLQVISYDKM . 

KYPLLPLAPQCSTNIRVLHLTECSLKMFDCSSIGNLSNLEVLSFANSRIEWLPSTVRN 

IJCKIJa.LDLRFCDGlJUEQGVlJCSLVKl£EFYIGNAYGFroDNCKDMAERSYNLSA 

DEFAFFNNKAEVKNMSFENLEM^SVGCSFDGMSMSSHSYENMLQLVTNKG 

DSKLNGLFIJKTEVLFIJSVHGMNDLEDVEVKSTHPTQSSSFCNLKVRnSKCVELRYL 

FKLHVANTLSSLEHLEVCGCENMEEUHTGIGGCGEETITFPKLKSLSLSQLPKLSGL 

CHhr^n^GLPHLVDUQiCGIPGFIWPQNKUlTSSLUCEEVVIPKI^TLQroGME^ 

EEIWPCEI^GGEKVKIJREIKVSSCDKLVNIJFPHNPMSIXHHI^EIXVK^ 

NroLIX:VSAIGEEDNKSIlJUaK\^IX3KUlEVWRIKGADNSRPlJHGFPA\^ 

GCKRFRNIFTPITANFDLVALLEIHIGNYRENHESEEQIEILSEKETLQEVTDTNISND 

VVUTSCUwIHSFHNLHKlJCI^NYEGVEVVFEIESESPTCRELVTTHNNQQQPniJ^ 

LQELYIJlNMDNTSHVWKCSNWNKFFriJ»KQQSESPFHNLTTIEMRWCHGFRYLFS 

PmAElIJSNLKKViaiX3CDGffiEWSNIU)DEDEEMTTFrSTHTT^ 

YMHCLKCIGGGGAKDEGSNEISFNNTTTTTDQFKLSEAGGVCWSLCQYSREIEIYRC 

DALSSVIPCYAAGQMQKLQVLTVSSCNGLKEVFETQLGTSSNKNNEKSGCEEGIPR 

VNNNVMIJNLKILmYGCGGI^HIFTFSALESLRQLQELTIKGYYTLVNLPNLKEM 

RLEWIJSNLRYIWKSNQWTAFEFPNLTRVEICECNSLEHVFTSSMVGSLLQLQELHIF 

NCSmEEVIVKDADVSVEEDKEKESIX3KTNKEILVLPHliCSIJa.QLLRSLKGFSLGK 

EDFSFPlXDTLEIKRCPTnTFTKGNSATPQLKEIQTNFGFFYAAGEKDINSLIKIKQQ 

DFKQDSD.CEVNK 

RG2E polynucleotide sequence (SEQ ID NO:96) 

TGGGAAGACACAATGATGCAAAGGTTGAAGAAGGTTGCTAAAGAAAATAGAAT 
GTTCAATTATATGGTTGAGGCAGTTATAGGGGAAAAGACAGACCCACTTGCTAT 
TCAACAAGCTGTAGCGGATTACCTTTGTATAGAG1TAAAAGAAAGCACTAAACC 
AGC.JlAGAGCTGATAAGCTTCGTGAATGGTTTAAGGCCAACTCTGGAGAAGGTA 

aga-ataagttccttgtaatatttgatgatgtttggcagtccgttgatctggaag 

acattggtttaagtcattttccaaatcaaggtgtcgacttcaaggtcttgttga 

cttcacgagacgaacatgtttgcacagtaatgggggttgaagctaattcaattc 

tta.atgtgggacttctagtagaagcagaagcacaaagtttgttccagcaatttg 

tagaaacttttgagcccgagctccataagataggagaagatatcgtaaggaag 

tgttgtggtttacctattgccattaaaaccatggcatgtactctaagaaataaa 

agaaaggatgcatggaaggatgcacttttgcatttagagtaccatgacattagc 

agtgttgcgcccaaagtctttgaaacgagctaccataatctccacaacaaggag 

actaaatctgtgtttttgatgtgtggtttttttcctgaagacttcaatattccaa 

tcgaggagttgatgaggtatggatggggcttaaagatatttgatagagtttata 

ctattagacaagcaagaatcaggctcaacacctgcattoagcgactggtgcag 

acaaatttgttaatagaaagtgatgatggtgtgcacgtcaagatgcatgatctg 

gtccgtgctttcgttttggttatgttttctgaagttgaacatgcttcaattatca 

accatggtaatatgcttggatggcctgaaaattatatgaccaactcttgcaaaa 

caatttcattaacatgcaagagtatgtctgaatttccgggagatctcaagtttc 

caaacctaacgattttgaaactcatgcatggagataagttgctaagatatcctc 
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AAGACTTTTATGAAGGAATGGAAAAGCTCTGGGTTATATCATATGATGAAATGA 
AGTATCCATTGCTTCCCTCGTTACCTCAATGCTCCATCAACCTTCGAGTGCTTCA 
CCTCCATCGATGCTCATTAATGATGTTTGATTGCTCTTGTATTGGAAATATGTTG 
AATCTGGAAGTGCTTAGCTITGTTAAATCTGGCATTGAATGGTTACCTTCCACA 
5 ATAGGAAATTTAAAGAAGCTAAGGTTACTTGATCTGAGAGATTGTTATGGTCTT 
CGTATAGAAAAAGGTGTOTGAAAAATTTGGTGAAAATTGGAGGAATTTATATT 
GGTAGAGCAGATATTTTATAGAT 



RG2E deduced polypeptide sequence (SEQ ID NO:97) 

10 WEDTMMQRUCKVAKENRMFNYMVEAVIGEKTDPLAIQQAVADYLCIELKESTKP 
ARADKLREWFKANSGEGKNKFLVIFDDVWQSVDLEDIGLSHFPNQGVDFICVLLTS 
RDEHVCrVMGVEANSILNVGliVEAEAQSIJQQFVETFEPELHKIGEDIVRKCCGL 
PIAIKmACTIJmKWaJAWKDAIIJILEYHDISSVAPKVFETSYHNIJHNKE^ 
MCGFFPEDFNIPIEELMRYGWGLKIFDRVYTIRQARIRLNTCIERLVQTNLLIESDDG 

15 VH\TCMHDLVRAFVLVWSEVEHASIINHGNMLGWPENYMTNSCKTISLTCKSMSE 
FPGDUCITNLTIIJaMHGDKUJlYPQDFYEGMEKLWVISYDEMKYPIJPSI^^ 
NLRVLHLHRCSLMMFDCSCIGNMLNI^Vl^FVKSGIEWIJSTIGNIJaCIJUiDl^ 
CYGLRIEKGVLKNLVKIGGIYIGRADIL. 

20 RG2F polynucleotide sequence (SEQ ID NO:98) 

CTGTGGAAGACACAATGATGCAAAGGCTGAAAAAGGTTGTGCATGAAAAGAAA 
ATGTTTAACTTTATTGTTGAAGCAGTTATAGGGGAAAAGACAGACCCCGTTGCC 
ATTCAGGATGCTATAGCAGATTACCTAGGTGTAGAGCTCAATGAAAAATCTAAG 
CAAGCAAGAGCTGATAAGCTCCGTCAAGGATTCAAGGACAAATCAGATGGAGG 

25 CAA.\AATAAGTTCTTTGTAATACTTGACGATGTTTGGCAGTCTGTTGATCTGGA 
AGATATTGGTTTAAGTCCTTTTCCAAATCAAGGCGTCGACTTCAAGGTCTTGTT 
GACATCACGAGACAGACATGTTTGCACAGTGATGGGGGTTGAAGCCAAATTAA 
TTCTAAACGTGGGACTTCTAATTGAAGCTGAAGCACAAAGTTTGTTCCACCAAT 
TTGTTGTCACTTCTGAGCCCGAGCTCCATAAGATAGGAGAAGATATTGTAAAGA 

30 AGTGTTTCGGTCTGCCAATTGCCATCAAAACCATGGCATGTACTCTACGACATA 
AAAGAAAGGATGCATGGAAGGATGCACTTTCACGTTTAGAGCACCATGACATT 
CAAAGTGTTGTGCCTAAAGTATTTGAAACGAGCTACAACAATCTCAAAGACAA 
GGAGACTAAATCCGTATTTTTGATGTGTGGTTTGTTTCCTGAAGACTTGGATAT 
ACCTATCGAGGAGTTGATGAGGTATGGATGGGGCTTAAGATTATTTGATAGAGT 

35 TAATACTATTACACAAGCAAGAAACAGGCTCAACACCTGCATTGAGCGACTGG 
TGCACACAAATTTGTTAATTGAAAGTGTTGATGGTGTGCATGTCAAGATGCATG 
ATCTGGTTCGTGCTTTTGTTTTGGGAATGTTTTCTGAAGTGGAGCATGCTTCAAT 
TGTCAACCATGGTAATATGCCCGAGTGGACTGAAAATGATATGACTGACTCTTG 
CAAACAAATTTCATTAACATGCAAGAGTATGTTGGAGTTTCCTGGAGACCTCAA 

40 GTTTCCAAACCTAAAGATTTTGAAACTTATGCATGGAGGTAAGTCACTAAGGTA 
TCCTCAAGACTTTTATCAAGGAATGGAAAAGCTGGAGGTTATATCATACGATGA 
AATGAAGTATCCATTGCTTCCCTCGTTGCCTCAATGTTCCACCATCCTTCGAGTG 
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CTTCATCTCCATGAATGTTCATTAAGGATGTTTGATTGCTCTTCAATCGGTAATC. 
TTTTCAACATGGAAGTGCTCAGCTTTGCTAATTCTAGCATTGAATTGTTACCTTC 
CGTAATTGGAAATTTGAAGAAGTTGCGGCTGCTAGATTTGACAAACTGTTATGG 
TGTTCGTATAGAAAAGGATGTCTTGAAAAATTTGGTGAAACTTGAAGAGCTTTA 
5 TATTAGGAATGGTCTACCAGTTTACAGAGGAT 



RG2F deduced polypeptide sequence (SEQ ID NO:99) 

VEDTMMQRLKKVVHEKKMFNFIVEAVIGEKTDPVAIQDAIADYLGVELNEKSKQA 
RADKUIQGFKDKSDGGKNKFFVIUJDWQSVDIJBDIGI^PFPNQGVDFKVIXTSRD 

10 RHVCIWGVEAKmj^GUJEAEAQSU^QFVVTSEPEUIKIGEDI^ 
KmACTXJfflKMOJAWKDALSRLEHHDIQSVWKVFETSYNNIimK^ 
LFPEDLDIPffiEIJvmYGWGLRLFDRVNmQAimRmTCIERLVHTNLLIESVDGVH 
VKMHDLVRAFVLGMFSEVEHASIVNHGNMPEWTENDMTDSCKQISLTCKSMLEFP 
GDUOTNUaUOMHGGKSIilYPQDFYQGMEKLEVISYDEMKYPLLPSLFQCSTILR 

15 VIJILHECSIJlMHDCSSIGNIJ^EVI^FANSSffiIJJ»SVIGNIJaaJU.IJ)LTO^ 
RIEKDVLKNLVKLEELYIRNGLPVYRG 

R62G polynucleotide sequence (SEQ ID NO:100) 

GAAGACACGATGATGAAGAACTGAAGGAGGTCGTGGGACAAAAGAAATCATTC 

20 AATATTATTATTCAAGTGGTCATAGGAGAGAAGACAAACCCTATTGCAATTCAG 
CAAGCTGTAGCAGATTACCTCTCTATAGAGCTGAAAGAAAACACTAAAGAAGC 
AAGAGCTGATAAGCTTCGTAAACGGTTTGAAGCCGATGGAGGAAAGAATAAGT 
TCCTTGTAATACTTGACGATGTATGGCAGTTTGTCGATCTTGAAGATATTGGTTT 
AAGTCCTCTGCCAAATAAAGGTGTCAACTTCAAGGTCTTGTTGACGTCAAGAGA 

25 TTCACATGTTTGCACTCTGATGGGAGCTGAAGCAAATTCAATTCTTAATATAAA 
AGTTTTAAAAGATGTAGAAGGACAAAGTTTGTTCCGCCAGTTTGCTAAAAATGC 
GGGTGATGATGACCTGGATCCTGCTTTCAATGGGATAGCAGATAGTATTGCAAG 
TAGATGTCAAGGTTTGCCCATTGCCATCAAAACCATTGCCTTAAGTCTTAAAGG 
TAGAAGCAAGTCTGCATGGGACGTTGCACTTTCTCGTCTGGAGAATCATAAGAT 

30 TGGTAGTGAAGAAGTTGTGCGTGAAGTTTTTAAAATTAGCTACGACAATCTCCA 
AGATGAGGTTACTAAATCTA'l'rrrrrrACrTTGTGCTTTATTTCCTGAAGATTT^ 
GATATTCCTACTGAGGAGTTGGTGAGGTATGGGTGGGGCTTGAAATTATTTATA 
GAAGCAAAAACTATAAGAGAAGCAAGAAACAGGCTCAACACCTGCACTGAGCG 
GCTTAGGGAGACAAATTTGTTATTTGGAAGTGATGACATTGGATGTGTCAAGAT 

35 GCACGATGTGGTGCGTGATTTTGTTTTGCATATATTCTCAGAAGTCCAACACGC 
TTCAATTGTCAACCATGGTAACGTGTCAGAGTGGCTAGAGGAAAATCATAGCAT 
CTACTCTTGTAAAAGAATTTCATTAACATGCAAGGGTATGTCTCAGTTTCCCAA 
AGACCTCAAATTTCCAAACCTTTCAATTTTGAAACTTATGCATGGAGATAAGTC 
ACTGAGCTTTCCTGAAAACTTTTATGGAAAGATGGAAAAGGTTCAGGTAATATC 

40 ATATGATAAATTGATGTATCCATTGCTTCCCTCATCACTTGAATGCTCCACCAA 
CGTTCGAGTGCTTCATCTTCATTACTGTTCATTAAGGATGTTTGATTGCTCTTCA 
ATTGGTAATCTTCTCAACATGGAAGTGCTCAGCTTTGCTAATTCTAACATTGAA 
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TGGTTACCATCTACAATTGGAAATTTGAAGAAGCTAAGGCTACTAGATTTGACA. 
AATTGTAAAGGTCTTCGTATAGATAATGGTGTCTTAAAAAATTTGGTCAAACTT 
GAAGAGCTTTATATGGGTGTTAATCGTCCGTATGGACAGGCCGTTAGCTTGACA 
GATGAAAA 

5 

RG2G deduced polypeptide sequence (SEQ ID NO:101) 

RHDDEEUOEWGQKKSFNraQWIGEKTNPIAIQQAVADYI^IELKENTKEARADKL 

RKRFEADGGKNKFLVILDDVWQFVDLEDIGLSPLPNKGVNFKVLLTSRDSHVCTL 

MGAEANSILIOTCVLKDVEGQSLFRQFAKNAGDDDLDPAFNGIADSIASRCQGLPIAI 

10 KTIAl^UCGRSKSAWDVAI^RLENHKIGSEEVVREVFKISYDNUJDEVTKSIFLir 
FPEDFDIPTEELWYGWGLKLHEAKTIRBARNRIJ^Cr^ERlJlET^a.IJ'^^^ 
MHDVVRDFVUHFSEVQHASIVNHGNVSiEWIJEENHSn^SCKMSLTCKGMSQFPK^ 
KFPNLSILKLMHGDKSLSFPENFYGKMEKVQVISYDKLMYPLLPSSLECSTNVRVLH 
UIYCSUmPDCSSIGNLLNMEVl^FANSNffiWLPSTIGNLKKLRLLDLTNCKGLRID 

15 NGMJCNLVKLEELYMGVNRPYGQAVSLTDE 

RG2H polynucleotide sequence (SEQ ID NO:102) 

TGA.\GGAGGTTGTGGAACGAAAGAAAATGTTCAGTATTATTGTTCAAGTG 
GTCATAGGAGAGAAGACAAACCCTATTGCTATTCAGCAAGCTGTAGCAGA 

20 TTACCTCTCTATAGAGCTGAAAGAAAACACTAAAGAAGCAAGAGCTGATA 
AGCTTCGTAAATGGTTCGAGGCCGATGGAGGAAAGAATAAGTTCCTTGTA 
ATACTTGACGATGTATGGCAGTTTGTCGATCTTGAAGATATTGGTTTAAG 
TCCTCTGCCAAATAAAGGTGTCAACTTCAAGGTCTTGTTGACGTCAAGAG 
ATTCACATGTTTGCACTCTGATGGGAGCCGAAGCCAATTCAATTCTCAAT 

25 ATA.\AAGTTTTAACAGCTGTAGAAGGACAAAGTTTGTTCCGCCAGTTTGC 
TAAAAATGCGGGTGATGATGACCTGGATCCTGCTTTCAATAGGATAGCAG 
ATAGTATTGCAAGTAGATGTCAAGGTTTGCCCATTGCCATCAAAACCATT 
GCCTTAAGTCTTAAAGGTAGAAGCAAGCCTGCGTGGGACCATGCGCTTTC 
TCGTTTGGAGAACCATAAGATTGGTAGTGAAGAAGTTGTGCGTGAAGTTT 

30 TTA-^AATTAGCTATGACAATCTCCAAGATGAGATTACTAAATCTATTTTT 
TTACTTTGTGCirrATTTCCTGAAGATTTTGATATTCCTACTGAGGAGTT 
GATGAGGTATGGATGGGGCTTGAAATTATTTATAGAAGCAAAAACTATAA 
GAGAAGCAAGAAACAGGCTCAACACCTGCACTGAGCGGCTTAGGGAGACA 
AATTTGTTATTTGGAAGCGATGACATTGGATGCGTCAAGATGCACGATGT 

35 GGTGCGTGATTTTGTTTTGCATATATTCTCAGAAGTCCAGCACGCTTCAA 
TTGTCAACCATGGTAACGTGTCAGAGTGGCTAGAGGAAAATCATAGCATC 
TACTCTTGTAAAAGAATTTCATTAACATGCAAGGGTATGTCTGAGTTTCC 
CAAAGACCTCAAATTTCCAAACCTTTCAATTTTGAAACTTATGCATGGAG 
ATAAGTCGCTGAGCTTTCCTGAAAACTTTTATGGAAAGATGGAAAAGGTT 

40 CAGGTAATATCATATGATAAATTGATGTATCCATTGCTTCCCTCATCACT 
TGAATGCTCCACTAACGTTCGAGTGCTTCATCTCCATTATTGTTCATTAA 
GGATGTTTGATTGCTCTTCAATTGGTAATCTTCTCAACATGGAAGTGCTC 



/ 
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AGCTTTGCTAATTCTAACATTGAATGGTTACCATCTACAATTGGAAATTT 
GAAGAAGCTAAGGCTACTAGATTTGACAAATTGTAAAGGTCTTCGTATAG 
ATAATGGTGTCTTAAAAAATTTGGTCAAACTTGAAGAGCTTTATATGGGT 
GTTAATCATCCGTATGGAC 

5 

RG2H deduced polypeptide sequence (SEQ ID NO:103) 

KEVVERKKMFSnVQWIGEKTNPIAIQQAVADYl^ffilJCENTKEARADKLRKWFEA 
DGGKNKFLVILDDVWQFVDLEDIGLSPLPNKGVNFKVLLTSRDSHVCTLMGAEAN 
SI1JSTKVLTAVEGQSLFRQFAKNAGDDDU)PAFNRIADSIASRCQGU>IAIKTIALSLK 

10 GRSKPAWDHAl^RLENHKIGSEEVVREVFKISYDNU^DEITKSIFIirALW 

TEEO^YGWGIia.FIEAKTIREARNRLNTCTERIJlETNlJJGSDDIGCVmHDV^ 
DFVLHIFSEVQHASIV^raG^rVSEWLEENHSIYSCKRISLTCKGMSEFPKDLKFPNLSI 
LKLMHGDKSLSFPENFYGKMEKVQVISYDKLMYPLLPSSLECSTNVRVLHLHYCSL 
RMFDCSSIGNLLNMEVLSFANSNIEWLPSTIGNLKKLRLLDLTNCKGLRIDNGVLKN 

15 LVKLEELYMGVNHPYG 



RG2I polynucleotide sequence (SEQ ID NO:104) 

AAG.\AGAGCTGAAGGAGGTTGTGGAACAAAAGAAAACGTTCAATATTATT 
GTTCAAGTGGTCATAGGAGAGAAGACAAACCCTATTGCTATTCAGCAAGC 

20 TGTAGCAGATTCCCTCTCTATAGAGCTGAAAGAAAACACTAAAGAAGCAA 
GAGCTGATAAGCTTCGTAAATGGTTCGAGGCTGATGGAGGAAAGAATAAG 
TTCCTCGTNATACTTGACGATGTATGGCNGTTTGTTGATCTTGAAGATAT 
TGGTTTAAGTCCTCATCCAAATAAAGGTGTCANCTTCAAGGTCTTGTTGA 
CGTCAAGAGATTCACATGTTTGCACTCTGATGGGAGCTGAAGCCAATTCA 

25 ATTCTCAATATAAAAGTTTTAAAAGATGTAGAAGGAAAAAGTTTGTTCCG 
CCAGTTTGCTAAAAATGCGGGTGATGATGACCTGGATCCTGCTTTCATTG 
GGATAGCAGATAGTATTGCAAGTAGATGTCAAGGTTTGCCCATTGCCATC 
AAAACCATTGCCTTAAGTCTTAAAGGTAGAAGCAAGTCTGCATGGGACGT 
TGCACTTTCTCGTCTGGAGAATCATAAGATTGGTAGTGAAGAAGTTGTGC 

30 GTG.AAGTTTTTAAAATTAGCTATGACAATCTCCAAGATGAGGTTACTAAA 
TCTA'l"lU"ll"llACTTTGTGCrTTATTTCCTGAAGATTTTGATATTCCTAC 
TGAGGAGTTGGTGAGGTATGGGTGGGGCTTGAAATTATTTATAGAAGCAA 
AAACTATAAGAGAAGCAAGAAACAGGCTCAACACCTGCACTGAGCGGCTT 
AGGGAGACAAATTTGTTATTTGGAAGTGATGACATTGGATGCGTCAAGAT 

35 GCACGATGTGGTGCGTGATTTTGTTTTGCATATATTCTCAGAAGTCCAGC 
ACGCTTCAATTGTCAACCATGGTAATGTGTCAGAGTGGCTAGAGGAAAAT 
CATAGCATCTACTCTTGTAAAAGAATTTCATTAACATGCAAGGGTATGTC 
TGAGTTTCCCAAAGACCTCAAATTTCCAAACCTTTCAATTTTGAAACTTA 
TGCATGGAGATAAGTCGCTGAGCnTTCCTGAAAACTTTTATGGAAAGATG 

40 GAA-AAGGTTCAGGTAATATCATATGATAAATTGATGTATCCATTGCTTCC 
CTCATCACTTGAATGCTCCACCAACCTTCGAGTGCTTCATCTCCATGAAT 
GTTCATTAAGGATGTTTGATTGCTCTTCAATTGGTAATCTTCTCAACATG 
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GAAGTGCTCAGCTTTGCTAATTCTGGCATTGAATGGTTACCATCTACAAT 
TGGAAATTTGAAGAAGCTAAGGCTACTGGATCTGACAGATTGTGGAGGTC 
TTCATATAGATAATGGCGTCTTAAAAAATTTGGTCAAACTTGAAGAGCTT 
TATATGGGTGCTAATCGTCTGTTTGGAAAGTGCCAT 

5 

RG2I deduced polypeptide sequence (SEQ ID NO:105) 

EEU□EWEQKKTF^^IVQVVIGECTNPIAIQQAVADSIJSIELKENTKEARADKIJ^^ 
EADGGKNKFLmDDW?FVDLEDIGLSPHPNKGV?FKVLLTSRDSHVCTLMGAEA 
NSIL>raCVLKDVEGKSLFRQFAKNAGDDDLDPAnGIADSIASRCQGU>IAIKTIALSL 

10 KGRSKSAWDVALSRLENHKIGSEEVVREVFKISYDNLQDEVTKSIFLLCALFPEDFDI 
PTEELVRYGWGUOJTEAKTmEARNRli^TCrimjffiTNIXFGSDDIGC^^ 
RDFNOJDFSEVQHASIVraGNVSEWLEENHSIYSCKRISLTCKGMSEFPKDLKFPNLS 
ILKLMHGDKSLSFPENFYGKMEKVQVISYDKLMYPLLPSSLECSTNLRVLHLHECSL 
RMFDCSSIGNLLNMEVLSFANSGIEWLPSTIGNLKKLRLLDLTDCGGLfflDNGVLKN 

15 LVKLEELYMGANRLFGKCH 



RG2J polynucleotide sequence (SEQ ID NO:106) and (SEQ ID NO:107) . 

ATGTCCGACCCAACAGGGATTGTTGGTGCCATTATTAACCCAATTGCTCA 
AACGGCCTTGGTTCCCCTTACAGACCATGTAGGCTACATGATTTCCTGCA 

20 GAA.AATATGTGAGGGACATGCAAATGAAAATGACAGAGTTAAATACCTCA 
AGAATCAGTGCAGAGGAACACATTAGCCGGAACACAAGAAATCATCTTCA 
GATTCCATCTCAAATTAAGGATTGGTTGGACCAAGTAGAAGGGATCAGAG 
CGA.ATGTTGCAAACTTTCCAATTGATGTCATCAGTTGTTGTAGTCTCAGG 
ATCAGGCACAAGCTTGGACAGAAAGCCTTCAAGATAACTGAGCAGATCGA 

25 AAGTCTAACGAGACAAAATTCGCTGATTATCTGGACTGATGAACCTGTTC 
CCCTGGGAAGAGTTGGTTCCATGATTGCATCCACCTCTGCAGCATCAAGT 
GATCATCATGATGTCTTCCCTTCAAGAGAGCAAATTTTTAGGAAAGCACT 
AGA.AGCACTTGAACCCGTCCAAAAATCCCACATAATAGCCTTATGGGGGA 
TGGGCGGAGTGGGGAAGACCACGATGATGAAGAAGCTGAAAGAGGTCGTG 

30 GAACAAAAGAAAACGTGCAATATTATTGTTCAAGTGGTCATAGGAGAGAA 
GAC.AAACCCTATTGCTATCCAGCAAGCTGTAGCAGATTACCTCTCTATAG 
AGCTGAAAGAAAACACTAAAGAAGCAAGAGCTGATAAGCTTCGTAAACGG 
TTCGAAGCCGATGGAGGAAAGAATAAGTTCCTTGTAATACTTGACGATGT 
ATGGCAGTTTTTCGATCTTGAAGATATTGGTTTAAGTCCTCTGCCAAATA 

35 AAGGTGTCAACTTCAAGGTCTTGTTGACGTCAAGAGATTCACATGTTTGC 
ACTCTGATGGGAGCTGAAGCCAATTCTATTCTCAATATAAAAGTTTTAAA 
AGATGTAGAAGGAAAAAGTTTGTTCCGCCAGTTTGCTAAAAATGCGGGTG 
ATGATGACCTGGATCCTGCTTTCATTGGGATAGCAGATAGTATTGCAAGT 
AGATGTCAAGGTTTGCCCATTGCCATCAAAACCATTGCCTTAAGTCTTAA 

40 AGGTAGAAGCAAGTCTGCATGGGACGTCGCACTTTCTCGTCTGGAGAATC 
ATAAGATTGGTAGTGAAGAAGTTGTGCGTGAAGTTTTTAAAATTAGCTAT 
GACAATCTCCAAGATGAGGTTACTAAATCTATTTTTTTACTCTGTGCTTT 
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ATTTCCTGAAGATTTTGATATTCCTATTGAGGAGTTGGTGAGGTATGGGT 
GGGGCTTGAAATTATTTATAGAAGCAAAAACTATAAGAGAAGCAAGAAAC 
AGGCTCAACAACTGCACTGAGCXKKnTAGGGAGACAAATTTGTrATTTGG 
AAGTCATGACTTTGGGTGCGTCAAGATGCACGATGTGGTGCGTGATTTTG 
5 TTTTGCATATGTTTTCAGAAGTCAAGCATGCTTCAATTGTCAACCATGGT 
AACATGTCAGAGTGGCCAGAGAAAAATGATACCAGCAACTCTTGTAAAAG 
AATTTCATTAACATGCAAGGGTATGTCTAAGTTTCCTAAAGACATCAACT 
ATCCAAACCTTTTGATTTTGAAACTTATGCATGGAGATAAGTCGCTGTGC 
TTTCCTGAAAACTTTTATGGAAAGATGGAAAAGGTTCAGGTAATATCATA 

10 TGATAAATTGATGTATCCATTGCTTCCCTCATCACTTGAATGCTCCACTA 
ACGTTCGAGTGCTTCATCTCCATTATTGTTCATTAAGGATGTTTGATTGC 
TCirCAATTGGTAATCTTCTCAACATGGAAGTGCTCAGCTTTGCTAATTC 
TAACATTGAATGGTTACCATCTACAATTGGAAATTTGAAGAAGCTAAGGC 
TACTAGATTTGACAAATTGTAAAGGTCTTCGTATAGATAATGGTGTCTTA 

15 AAAAATTTGGTCAAACTTGAAGAGCTTTATATGGGTGTTAATCGTCCGTA 
TGGACAGGCCGTTAGCTTGACAGATGAAAACTGCAATGAAATGGTAGAAG 
GTTCCAAAAAACTTCTTGCACTAGAATATGAGTTGTTTAAATACAATGCT 
CAAGTGAAGAATATATCCTTCGAGAATCTTAAACGATTCAAGATCTCAGT 
GGGATGTTCTTTACATGGATCTTTCAGTAAAAGCAGGCACTCATACGAAA 

20 ACACGTTGAAGTTGGCCATTGACAAAGGCGAACTATTGGAATCCCGAATG 
AACGGGTTGTTTGAGAAAACGGAGGTTCTTTGTTTAAGTGTGGGGGATAT 
GTATCATCTTTCAGATGTTAAGGTGAAGTCCTCrrCGTTCTACAATTTAA 
GAGTCCTTGTCGTTTCAGAGTGTGCAGAGTTGAAACACCTCTTCACACTT 
GGTGTTGCAAATACTTTGTCAAAGCTTGAGCATCTTAAAGTCTACAAATG 

25 CGATAATATGGAAGAACTCATACATACCGGGGGTAGTGAAGGAGATACAA 
TTACATTCCCCAAGCTGAAGCTTTTATATTTGCATGGGCTGCCAAACCTA 
TTGGGTTTGTGTCTTAATGTCAACGCAATTGAGCTACCAAAACTTGTGCA 
AATGAAGCTTTACAGCATTCCGGGTTTCACAAGCATTTATCCGCGGAACA 
AGTTGGAAGCATCTAGTTTGTTGAAAGAAGAGGTACATATACATATAGTT 

30 TATGTTAATACATTTTAAACAATCTTTTCAACTAAAAGTTTCAGAATATA 
TCTGTATTTTGATTGTATGATGTGTTAGTGTTTGGATGTGGCTATTAAAG 
GAT.AATTATTTGGCAGGTTGTGATTCCTAAGTTGGATATACTTGAAATTC 
ATGACATGGAGAATTTAAAGGAAATATGGCCTAGTGAGCTTAGTAGAGGT 
GAGAAAGTTAAGTTGAGAAAGATTAAAGTGAGAAATTGTGATAAACTTGT 

35 GAATCTATTTCCACACAATCCCATGTCTCTGCTGCATCATCTTGAAGAGC 
TTATAGTCGAGAAATGTGGTTCCATTGAAGAGTTGTTCAACATCGACTTG 
GATTGTGCCAGTGTAATTGGAGAAGAAGACAACAACAGCAGCTTAAGAAA 
CATCAATGTGGAGAATTCAATGAAGCTAAGAGAGGTGTGGAGGATAAAAG 
GTGCAGATAACTCrCGTCCCCTCTTTCGTGGCTTTCAAGTTGTTGAAAAG 

40 ATA.ATCATTACGAGATGTAAGAGGTTTACAAATGTATTCACACCTATCAC 
CACAAATTTTGATCTGGGGGCACTTTTGGAGATTTCAGTTGATTGTAGAG 
GAAATGATGAATCAGACCAAAGTAACCAAGAGCAAGAGCAGGTATGGATT 
TCA.ATTTTACT<nTITACTTAATTAATGATTAAGCCCCTGCTTTTTA^^ 
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AAAAGGGGACAAACCATTTCTTGACTTAATGTTGCAATACAAGTCATGTA 
TAAGAGTGATTAACTTTTTTTTATTTATAAAATAACTACAAAACATGTTT 
TTTCATTATAGATCATGTATAAATGTGACTAATTTTTTTCATCGCCTAAC 
TTTTGTTGATAAATCATTAGAAATGTCACTAATTACTTTTTAGTATTTAT 

5 AAA.ATAACTACAAAACATGTTTTTTCATTATAGATCATGTATATATCAAC 
TAAAAATATTATTCCCTTACACAAAAAAAAAAGGTTCAAGAAAGCCTGTA 
TTTCGAAATAACTAAAAAGAAAATATTTGATATTCACTAAGAGAAATTTT 
TTTCTAAACATGATCGCAAATGATTAAAACTTAAATTAAAACTAAAAAGA 
TTTTTATATATGTTATNCAAAATTAAAATTTGAAATTAAGTTTATAATTC 

10 TNGTNTCACAAAGGGATATATATAGTAAAATATTATTTTTTTGCAGTCAT 
GCATAGTTGTATTTTTAAATGATTTATTAACGTGGTAGGAGTGGAAACCA 
CTCAATCTAGTAGACGCACTATCACATCTCACATCAGCTTTACATCTATT 
TTTCnTrCTCCTTTTTTCATCTTTTTAAACTCATAACACNTA^^ 
ATATTTTCCAACACACTNAACTCATTGTCACATTATTATTTTTAATTTAA 

15 TTA-\ATTNGAAAATTAAAATTAANTAAANCNTAACATTTTTTAATTAAAA 
AATATTAATCCAAATAAAAANTNCACGATAAATTAAAAANGTTTANTTTG 
GAAAAAAANCC (SEQ ID NO:106) 
Sequence gap 

ATA-ACCCTTTCAAGGGTCAACTCAAGTCCAAGTTAAAGTCAAGGTCAAAA 

20 CCTTGGTTAAAGTCAACTTTGGTCAAAGTCAACATCTACTTGACTCACCT 
CACCGAGTTGGTCCACCAACTTGTCGAGTCCCTTAATCCACAAACTTCAA 
GAACTTCGATCCTACTCGTCGAGTCTTTCAAGAACTCTTCGAGTTTCCAT 
TACACAGAATCGGGACCTTTTGCTCATGACTCGCCGAGTTCATCCTTGAA 
CTTGTCGAGTCTAGCTTCATACGAGTTCGAGTGTTTAGTCCTTGACTCGT 

25 CGAGTTCTTCCTTGAACTCGTCGAGTCCATCTTCGTATAGTTGGGACATT 
GCCTTGAACTCACCGAGTTCATCATTGAACTCATCGAGTCCTTCGATCTT 
CAAGTCCATAATCCTGTCCATCTTGTTGAGTCCTCTTCTAGACTCAACCA 
GATTCCTCAGAAACAGAAAAGGTTAGGGAACCATTACCTGACTCGCCGAG 
TCCCAAGAACGAATCCCCGAGTCCCCCAATGTCCATGACCATACAATCGA 

30 TTTTCGTTGGGCTCATTGCATCCAAAGCATAGATCTAACCTCCTAGGGTC 
CATATTACACGTAAAGCTACGAACTTGACGTCCATGCATGGGGGATTTGG 
CTC.AAATGGCATTAAAATGGGGTTTATCTGATGCATGGGACTCCCATGGC 
CATAAAGTTAACACCTTTATGCCATGGGAATCCTCAATGGTTCCATATCT 
GAAGTTAACACTCTACAATATGTTCTAAACCCGAAGGTGGCTTAGAAATG 

35 CCCCAAAATGGCAAGATTCAAGCCTTAAAGGAGATCTAACAAATGATAAG 
TCA_AGGTTCAAGCTTTTTACCTTGAATAAGCTGGAAATGAAGCAAAATCT 
CTGGATCCACTTGCTTCTTCAAGAACCCCCAAGCTTCCACTTCTTCCTTC 
AAGTTTCAAACAACTTTAAACACTCAAAAATGGCTCAAGAACACTCAAAA 
AGCTTTAGGGTTTCGAGTTAGGGCTTTTTGGAAGCGAGAGGGACGATGGG 

40 GGCTGAAATGAGGCTAGAAAAAGTGTTTAAATAGGGGGCAAACCCTAAAT 
ATTAGGGTTTCATCCAGGCAGCCCTACTCGTCGAGTCGGGCTCCCGACTC 
GTCGAGTAGGTCACTTAAAACCCGCGTCCATAATCCAGTCTACTCGACGA 
GTTGGGCCTCCAACTCGTCGATTCCGAGTGCAAAACGTTCAATTACTTAA 
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ATTTAAATATGTACCAGGAACCGGGTGTTACAGTTGAGACTTTATACCTC 
CATAAGATAGATCTAGGTGCACATAGCCTGGATCCACAAGCTCCATGTCA 
ACAAGCGACTCTTCAAGAAGTTCATTCTTCCTCCTTAAGCACCAAAAAAC 
ACACAAAATCACCATGAAGCTCAAGAAATACTCAAATAGAGGATAGGGTT 
5 TCGTTCGTAGGGTTAGAGAGGATGGAGGCTAGAGGAAATGAGGGATAGAG 
GCGAGTTAAGGTCTTTAAATAGGGTCCAAGACCCTAAATTAGGGTTTTAA 
TCTGGCCAGACGAACGCAGGGTGTTCCCAAATGCATATGTGTCCAAATTC 
TCGTGTGCGCCATGCGTACCTCCCTTGTACGCCATGTGTACCGGGTTTGG 
TCCAAACCCTTCTAACTTCAAATGATCATAACTTGCACCCCTTATCTGTT 

10 TTCGATGTTCTTTATATCCACGGAAAGGTAACAAGAAGCCCTATACTTCT 
ATAAACTTTATTTAATCTGAAAACCAACCGAAATTAAATCCAAAATTCAT 
AAAAGTCCCGAACCAACACATTTACCGATACCCTTGGGCTCCAAAACACA 
AATTGAAAACCCGGATCATCCAAACTACATCATCCACCTCCAAATGAGCC 
CAAACTCAATTATTCAAGGGTTCTAAGCCTGTTAATGCCCACTCCTCGAT 

15 TACCACCCCGCAATGGGAAACGATTCAAAACAGGGCGTTACATAATTTGT 
TGTGGTTTTGTATTTTTTATTTCCGGTGAAGGTGAAAGATCCAACTATTT 
TTAATCrGTTGGCATTTTCCATCATTTGCAACTGTTTCnTGAAAAAAA/^ 
TACCTAAAATCAAAATAACCATTTTCAAATCCAAAATTATAAGAGAGAAT 
TGT.\AATGGACATGGAATCTTAAATCATTAACACAGTTCAGTACACAAGT 

20 TGCTAATTACATTTCTTGCTGTGCAGATTGAAATTCTATCAGAGAAAGAG 
ACATTACAAGAAGCCACTGACAGTATTTCTAATGTTGTATTCCCATCCTG 
TCTCATGCACTCTTTTCATAACCTCCAGAAACTTATATTGAACAGAGTTA 
AAGGAGTGGAGGTGGTGTTTGAGATAGAGAGTGAGAGTCCAACAAGTAGA 
GAATTGGTAACAACTCACCATAACCAACAACAGCCTGTTATATTTCCCAA 

25 CCTCCAGCATTTGGATCTAAGGGGTATGGACAACATGATTCGCGTGTGGA 
AGTGCAGCAACTGGAATAAATTCTTCACTCTTCCAAAACAACAATCAGAA 
TCCCCATTCCACAACCTCACAACCATAAATATTGATTTTTGCAGAAGCAT 
TAAGTACTTGTTTTCACCTCTCATGGCAGAACTTCTTTCCAACCTAAAGA 
AAGTCAATATAAAATGGTGTTATGGTATTGAAGAAGTTGTTTCAAACAGA 

30 GATGATGAGGATGAAGAAATGACTACATTTACATCTACCCACACAACCAC 
CATCTTGTTCCCTCATCTTGATTCTCTCACTCTAAGTTTCCTGGAGAATC 
TGAAGTGTATTGGTGGAGGTGGTGCCAAGGATGAGGGGAGCAATGAAATA 
TCTTTCAATAATACCACTGCAACTACTGCTGTTCTTGATCAATTTGAGGT 
ATGCTTTGTTCATATTCAATTATTTATTTAATTTCCTTTTTTAm 

35 TATTCTATAAATAATACATTTTATACCCACTATACTAAGATAATAATTAC 
CTAGAGGGATGGATGCTATGACACAGCTGCTACACTTCAGAAACTCTAGT 
AAGGGCAGTTATGGAAGTTCAATAAAATGATAATGGCATCTTTTGATGGG 
TAATATAGGCAATTTAAGTTTTATTTCTGTTAAAGCAGTATTTAGCAAGT 
ACTGGCCAGTAGGAGAGGAGAATATCACCTTTTGTGAAAATCTGGTCATT 

40 GTACCCAGAATTTAGTTAAATGTAACATTTTAGATATCAGGGGACATCAG 
GTGACAGATATTGTAGAATAGAACAATATATAATATTACCCAAAACTATT 
TTirCTAAGGTTTTTCTGTTAAATATGTGCTTTCTTGATTTCATTGAATT 
TGCATTCCTATATTTTAGGTGGTAAAGTGATTGTCTCTTCAATAAATCCC 
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gaa.\ttaattaaaaaaaaaaaaaaacaaaagtaaatttttgatatggaga 
gcactggtatcatttagtatataaaaaaactagattttgaattaagtttc 
ttatataaaagctgtgtatatagtttaattagttttacatcatttttcca 
tgtggtgttgcagttgtctgaagcaggtggtgtttcttggagcttatgcc 

5 aatacgctAgagagataagtatagaattctgcaatgcattgtcaagtgtg 
attccatgttatgcagcaggacaaatgcaaaagcttcaagtgctgacagt 
cagttcttgtaatggtctgaaggaggtatttgaaactcaattaaggagga 
gcagcaacaaaaacaacgagaagagtggttgtgatgaaggaaatggtgga 
attccaagagtaaataacaatgttattatggtttctggtctgaagatatt 

10 gga.\atcagcttttgtgggggtttggaacatatattcacattctctgcac 
ttgaaagcctgagacagctcgaagagttaacgataatgaattgctggtca 

ATGAAAGTGATTGTGAAGAAGGAAGAAGATGAATATGGAGAGCAGCAAAC 

AACAACAACAACGAAGGGGACTT<mxnTCTTCTTCTTCTTCTT(nTCTT 

(m"CTTCTTCTTCTTCTTCTCCTCCTTCTTCTTCTAAGAAGGTTGTGGTC 

15 TTTCCTTGTCTAAAGTCCATTGTATTGGTCAATCTACCAGAGCTGGTAGG 
ATTCTTCTTGGGGATGAATGAGTTCCGGTTGCCTTCATTAGATGAACTTA 
TCATCGAGAAATGCCCAAAAATGATGGTGTTTACAGCTGGTGGGTCCACA 
GCTCCCCAACTCAAGTATATACACACAAGATTAGGCAAACATACTATTGA 
TCA-\GAATCTGGCCTTAACTTTCATCAGGTATATATGTTTCTTTAATTGG 

20 CATCATCTAATTAAGAAAGATATCATTCCTGCCAAGTAAATTTACTTCAA 
ACACATTCACACTGGTTTCAGTCTAAGTTTATGTTGTTCTAGGAAGGCCA 
AAATGGGAAAGCAAGATAGGGAAAAATAGTGTATTTCAGTGGAAAGGGTA 
TTTTAGGTATTTTCTGTCAAAAGTTGTTATTGCAGGCTTTTTAGTACCTG 
GAATCGTGTGTGGGAGGAGCATTATTATTCTGATTTGCTTGTTTCTTTAT 

25 CATTTTTTCTTAGCCTCTCGAACAGCTAGAAACCCTTTTAATCTTTTGAT 
TTT.\AATGACAAAATTTTTCCCTGTTACTCTATTTGATTGTTGTTCTTCA 
TGGTTCTAAGTGAGTTATTGGCTCATCTGTTACTTCTTTTGATTGTTATT 
TTCATAGCATGTTAGTCACTTGAATCAAGCTTTTTCATTTTCAACCAGGG 
CAAAAGGTCAAAAGTAACCTACTTTATGAGATCAAAAACAGCAACCCATC 

30 GGATAACTTTTAGTTGGAGTTAATAGTTACAATTACCATTGTGATTAATA 
ATTATAATATCCTGTATTAATTCATAAAAATTGGTACAGCACATATATGA 
CATTTCAAAGGTTTTTGTTTGACATATATATGCCTCTGGCGTTTTCTTTA. 
TTGGACTTGCAGACCTCATTCCAAAGTTTATACGGTGACACCTTGGGCCC 
TGCrACTTCAGAAGGGACAACTTGGTCTTTTCATAACTTGATTGAATTAG 

35 ATGTGAAATTTAATAAGGATGTTAAAAAGATTATTCCATCCAGTGAGTTG 
CTGCAACTGCAAAAGCTGGAAAAGATAAATATAAACAGTTGTGTTGGGGT 
AGAGGAGGTATTTGAAACTGCATTGGAAGCAGCAGGGAGAAATGGAAATA 
GTGGAATTGGTTTTGATGAATCGTCACAAACAACTACCACTACTCTTGTC 
AATCTTCCAAACCTTAGAGAAATGAACTTATGGGGTCTAGATTGTCTGAG 

40 GTATATATGGAAGAGCAATCAGTGGACAGCATTTGAGTTTCCAAAACTAA 
CAAGAGTTGAAATTAGTAATTGCAACAGTTTAGAACATGTATTTACTAGT 
TCCATGGTTGGTAGTCTATCGCAACTCCAAGAGCTACATATAAGTCAGTG 
CA.AACTTATGGAGGAGGTGATTGTTAAGGATGCAGATGTTTCTGTAGAAG 
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AAGACAAAGAGAAAGAATCTGATGGCAAGATGAATAAGGAGATACTTGCG 
TTACCTAGTCTAAAGTCCCTGAAATTAGAAAGCTTACCATCTCTTGAGGG 
GTTTAGCTTGGGGAAGGAGGATTTTTCATTCCCATTATTGGATACTTTAA 
GAATTGAGGAATGCCCAGCAATAACCACCTTCACCAAGGGAAATTCCGCT 

5 ACTCCACAACTAAGAGAAATAGAAACAAGATTTGGCTCGGTTTATGCAGG 
GGAAGACATCAAATCCTCTATTATAAAGATCAAACAACAGGTAAATCAGA 
TCATTGTTGGTTTAATAATTCTTAAACTACATTTGAAAAGTTTCATGTAA 
GTTTTTTATTATTGTCAAAAGCCGCAACCrATATTTTCAACrrTATATTT 
ATGTACTTTATGCAGGATTTCAAAAAAGCCCAGGACTCTATTTAATGTGA 

10 AGTAAATACTAGAAGAGGTAAATTCTATTTACATGTCTCCTGATTGCCTA 
TTAATTAATGGCCTTTCAGTTCATGGTTTTTGGATGTATTCTTCATGATG 
ACGTGAATGTTTAAATACCCCACTAGTTAATTGTTAGGTTGAATGTTGAT 
GACCAAAGGACTATATGTCGGGAAGAATATTCAAGGAAAGAATTGTTCAT 
CATATGAAGGGCATTAAATTAAGAAGAACATGGATGCTATGAAGATGTTG 

15 GGA-AAATATATGAATCAAATAACAAGCTACTCACTTATCTAAGTTTGTTG 
GTTGAGGATGTTGATTTTAATATTTCAAATTCATTGGTATCATTATATGG 
GTTTATCAGTAGTGTTAATGGGATAATGAGCAACTTAACCTTAAATTATG 
CTGTTGGTAAATGTTGGACTCAAGTATGGAAAATTAGGAATAACTTGTGA 
AAA.-\TATATGCAAAAGTAGGATTGAGATTTTCAATGAAAAAAATTATGAA 

20 ACTATACTACTATAGTATATAAATAAATTCAACTTACTGTTGGGTATATT 
GGAAGCACATATCATGAAAGTAACTAGAAGCAGAATTTGTTCCCATCTTC 
ATCTACTTATAGTTTCCATTTCTTACTTGTAAAAATCTGATTAAACTTTA 
GAGTTATTTCTATTTTTTACCAACCAAAATTTTCATATAAAGGCCACAAG 
T (SEQ ID NO:107) 

25 

RG2J deduced polypeptide sequence (SEQ ID NO:108) 

MSDPTGWGAIINPIAQTALWLTDHVGYMISCRKYVIUJMQMKMTELNTSRISAEEH 

ISRNTRNHLQIPSQIKDWLDQVEGIRANVANFPIDVISCCSLRIRHKLGQKAFKITEQI 

ESLTRQNSLIIWTDEPVPLGRVGSMIASTSAASSDHHDVFPSREQIFRKALEALEPVQ 

30 KSimALWGMGGVGKTTMMKKLKEVVEQKKTCNnVQVVIGEKTNPIAIQQAVADY 
I^IEIJCENTKEARADKIJlKia'EADGGKNm.VILDDVWQFFDI^IGI^PLPNK^^ 
NFKVLLTSRDSHVCTLMGAEANSILNIKVLKDVEGKSLFRQFAKNAGDDDLDPAH 
GIADSIASRCQGLPIAIKTIALSLKGRSKSAWDVALSRLENHKIGSEEWREVFKISYD 
NLQDEVTKSIFLLCALFPEDFDIPIEELVRYGWGLKLFIEAKTIREARNRLNNCTERL 

35 RETNLLFGSHDFGCVKMHDVVRDFVLHMFSEVKHASrVNHGNMSEWPEKNDTSN 
SCKRISLTCKGMSKFPKDmYPNLULKLMHGDKSLCFPENFYGKMEKVQVISYDKL 
MYPLLPSSLECSTNVRVLHLHYCSLRMFDCSSIGNLLNMEVLSFANSNIEWLPSTIG 
NUCKLRLLDLTNCKGUUDNGVLKNLVKLEELYMGVNRPYGQAVSLTDENCNEM 
VEGSKXLIALEYEIJXYNAQVKNISFENLKRFKISVGCSLHGSFSKSRHSYENTLKL 

40 AIDKGELLESRMNGLFEKTEVLCLSVGDMYHLSDVKVKSSSFYNLRVLWSECAEL 
KHLFTLGVANTLSKLEHLKVYKCDNMEELIHTGGSEGDTITFPKLKLLYLHGLPNL 
LGLCLNVNAIELPKLVQMKLYSIPGFTSIYPRNKLEASSLLKEEVVIPEEUVEKCGSI 
EELFNIDIJ)CASVIGEEDNNSSlJl^fI^rV^NSMKIJlEWRIKGADNSRPIJRGFQVVE 
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KmTRCKRFTNVFTPITTNFDLGALLEISVDCRGNDESDQSNQEQEQIEILSEKETLQE 
ATDSISNVVFPSCIJ^SFH>fLQKULNR\TCGVEVVFEIESESPTSRELVTTHHNQQQP 
VIFPNLQHLDUlGMDNMmVWKCSNWNKFFTU»KQQSESPFHNLT^^ 
LFSPDvlAELI^NIJCKVNIKWCYGIEEWSNRDDEDEEMTTFrSTHTTmFPHlJ^ 

5 TLSFLENLKCIGGGGAKDEGSNEISFNNTTATTAVLDQFELSEAGGVSWSLCQYAR 
EISffiFCNAl^SVIPCYAAGQMQKLQVLTVSSCNGlJCEVFETQUlRSSNKNNEKSGC 
DEGNGGIPRVNNNVmiJSGUaLHSFCGGLEHIFTFSALESUlQLpLTIMNCWSMK 
VIVKKEEDEYGEQQTTTTlXGTSSSSSSSSSSSSSSSSPPSSSKKVVviFPCI^ 
ELVGFFLGMNEFRLPSLDEUIEKCPKMMVFTAGGSTAPQLKYIHTRLGKHTIDQES 

10 GLNFHQDIYMPLAFSLLDLQTSFQSLYGDTLGPATSEGTTWSFHNUELDVKFNKD 
VKKHPSSEIXQLQKIJEKINmSCVGVEEVFETALEAAGRNGNSGIGFDESSQTTTTTL 
VNIJ»^^JU^MNLWGU)CIJlYIWKSNQWTAFEFPKLTRVmSNCNSLEHVF^SSMV^^ 
I^QLQELHISQCXIJ^VIVKDADVSVEEDKEKESDGKMNKEII^PSIiCSU^ 
PSLEGFSLGKEDFSFPLLDTLRIEECPAITTFTKGNSATPQLREIETRFGSVYAGEDIKS 

15 smOKQQDFKKAQDSI.CEVNTR 

RG2K polynucleotide sequence (SEQ ID NO: 109) and (SEQ ID NO:110) 

TGGGATTCCATATATAAAAACATATATTTTTATAAAGTGGGATTCCATTG 
TTTATATAGATTTTTATTCACCAATAGACAATAGATTAAAAAAAGATATA 

20 AAAACATGTCGGCTTTTGACTAAAAATATAGATTTTTATGAATAGAATAT 
TCAATTTGCTTAACTCGTTTAAAAAAAATGAAAAAGATGTCGATATAAAA 
TCTCATATGGGCCTTCTTTACCATTCAAATAGTAAAATAGTAAAAGATAC 
TTGTTTGGGGCATGAACTGACCATAGTCAAACCCATACAAAATCAAACGA 
ATCCCACATGGATGATGACGATGGGGTCGCAGTAAATGTGTTTTGGTCCT 

25 TTTTTTTCGAGAGAACAGAAGCTTCTGCTCTTCATCTTCTTTAGATTTTG 
GGGATTTTCTGGTTTCAGGGGTTTGTGAGTGGAAACTAAATTGAAGCAAA 
AAAGTATGGTATAATTGGTTGCTAGTGAAATTGATGCTTTCTATTACTAT 
CATCTTTAAAATTGTCAAAACATTATGTATTAAATTATGAGATCGAAAGT 
GGTCTATGGGCCAAAGGTAATACAAGCTTACTCAATGAAATGAATCTAGG 

30 ATGCATCATGCATGTATTGGTTAGATTAAAGATTTTCATCAAATTTCCTT 
TATCAAATTGTTGTATACCATGTTATGTAGGTGCTACCACAAGCCATAAC 
ATCGAGCAATGGAGTGTATTACTGGCATCTTTAGCAACCCGTTTGCTCAG 
TGTCTCATCGCTCCTGTGAAAGAACACCTTTGCCTTCTGATTTTCTATAC 
ACAATATGTAGGGGATATGCTTACTGCAATGACGGAGTTGAATGCTGCAA 

35 AAGACATTGTTGAAGAGCGGAAGAATCAAAACGTAGAAAAATGTTTTGAG 
GTTCCAAACCATGTCAACCGTTGGTTGGAAGATGTTCAAACAATCAACAG 
AAAAGTGGAACGTGTTCTTAACGATAATTGCAATTGGTTCAATCTATGTA 
ATAGGTACATGCTCGCAGTGAAAGCCTTGGAGATAACTCAGGAGATCGAT 
. CATGCCATGAAACAACTCTCTCGGATAGAATGGACTGATGATTCAGTTCC 

40 TTTGGGAAGAAATGATTCCACAAAGGCATCCACCTCTACACCATCAAGTG 
ATTACAATGACTTCGAGTCAAGAGAACACACTTTTAGGAAAGCACTTGAA 
GCACTTGGATCCAACCACACATCCCACATGGTAGCCTTATGGGGGATGGG , 
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TGGAGTTGGGAAGACCACGATGATGAAGAGGCTGAAAAATATTATTAAAG 
AAAAGAGGACGTTTCATTATATTGTTTTGGTGGTTATAAAGGAAAATATG 
GATCTCATTTCCATCCAGGATGCTGTAGCAGATTATCTGGATATGAAGCT 
AACAGAAAGCAATGAATCAGAAAGAGCCGATAAACTTCGTGAAGGGTTTC 

5 AGGCCAAATCAGATGGAGGTAAGAATAGGTTCCTCATAATACTGGATGAT 
GTATGGCAATCTGTTAATATGGAAGATATTGGTTTAAGTCCTTTTCCGAA 
TCAAGGTGTCGACTTCAAGGTCTTGTTGACCTCGGAAAACAAAGATGTTT 
GTGCAAAAATGGGAGTTGAAGCTAATTTAATTTTCGACGTGAAATTCTTA 
ACAGAAGAAGAAGCACAAAGTTTGTTTTATCAATTTGTAAAAGTTTCTGA 

10 TACCCACCTTGATAAGATTGGAAAAGCTATTGTAAGAAACTGTGGTGGTC 
TACCCATTGCCATCAAAACCATAGCCAATACTCTTAAAAATAGAAACAAG 
GATGTATGGAAGGATGCACTTTCTCGTATAGAGCATCATGACATTGAGAC 
AATTGCACATGTrGTTTTTCAAATGAGCTACGACAATCTCCAAAACGAAG 

15 ATTCCTACTGAGGAATTGGTGAGGTATGGATGGGGATTGAGAGTATTTAA 
TGGAGTGTATACTATAGGAGAAGCAAGACACAGGTTGAACGCCTACATCG 
AGCTGCTCAAGGATTCTAATTTATTGATTGAAAGTGATGATGTTCACTGC 
ATC.\AGATGCATGATTTAGTTCGTGCTTTTGTTTTGGATACGTTTAATAG 
ATTCAAGCATTCTTTGATTGTTAACCATGGTAATGGTGGTATGTTAGGGT 

20 GGCCTGAAAATGATATGAGTGCCTCATCTTGCAAAAGAATTTCATTAATA 
TGC.\AGGGCATGTCCGATTTTCCTAGAGACGTAAAGTTTCCAAATCTCTT 
GATTTTGAAACTrATGCATGCAGATAAGTCTTTGAAGTTTCCTCAAGACT 
TTTATGGAGAAATGAAGAAGCTTCAGGTTATATCATACGATCACATGAAG 
TATCCCTTGCTTCCAACATCACCTCAATGCTCCACCAACCTTCGTGTGCT 

25 TCATCTTCATCAATGCTCATTGATGTTTGATTGCTCTTCTATTGGAAATC 
TGTTGAATCTGGAAGTGCTCAGCTTTGCTAATTCTGGTATTGAGTGGTTG 
CCTTCCACAATCGGAAATTTGAAGGAGCTAAGGGTACTAGATTTGACAAA 
TTGTGATGGTCTTCGTATAGATAATGGTGTCCTAAAGAAATTGGTGAAAC 
TTG.AAGAGCTTTATATGAGAGTTGGTGGTCGATATCAAAAGGCCATTAGC 

30 TTCACTGATGAAAACTGCAATGAAATGGCAGAGCGTTCAAAAAATCTTTC 
TGCATTAGAATTTGAGTTCTTCAAAAACAATGCTCAACCAAAGAATATGT 
CATTTGAGAATCTTGAACGATTCAAGATCTCAGTGGGATGTTATTTTAAG 
GGAGATTTCGGTAAGATCTTTCACTCTTTTGAAAACACGTTGCGGTTGGT 
CACCAACAGAACTGAAGTTCTTGAATCTAGGCTTAATGAGTTGTTTGAGA 

35 AAACAGATGTTCTTTATTTAAGTGTGGGAGATATGAATGATCTTGAAGAT 
GTTGAGGTAAAGTTGGCACATCTTCCTAAATCCTCTTCCTTCCACAATTT 
AAGAGTCOTATCATTTCTGAGTGTATAGAGTTGAGATACCTTTTCACAC 
TTGATGTTGCAAACACTTTGTCAAAGCTTGAGCATCTTCAAGTTTACGAA 
TGCGATAATATGGAAGAAATCATACATACAGAGGGTAGAGGAGAAGTGAC 

40 AATTACATTCCCAAAGCTGAAGTTTTTATCATTGTGTGGGCTACCAAATC 
TGTTGGGTTTGTGTGGTAATGTGCACATAATTAATCTACCACAACTCACA 
GAGTTGAAACm'AATGGCATTCCAGGTTTCACAAGCATATATCCTGAAAA 
AGATGTTGAAACATCTAGTTTGTTGAATAAAGAGGTAAATGTGTTTTATG 
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TTAATACAATACAATCTTTTCAATTAACCGTTTCAAAATATATTGTATGA 
TTTATTTTTGTTTGGATGGGGTTATTAATGGGTGATTATTTCTCAGGTTG 
TAATTCCTAATTTGGAGAAACTTGATATTAGTTATATGAAGGATTTGAAA 
GAGATATGGCCTTGTGAATTAGGGATGAGTCAGGAAGTTGATGTTTCTAC 

5 GTTGAGAGTGATTAAAGTAAGCAGTTGTGATAATCTTGTGAATCTATTCC 
CGTGCAATCCTATGCCATTGATACATCACCTTGAAGAGCTTCAAGTGATA 
TTTTGTGGTTCCATTGAAGTGTTATTCAACATTGAGTTGGATTCTATTGG 
TCAAATTGGAGAAGGCATCAACAATAGCAGCTTGAGAATCATCCAATTGC 
AGAACTTAGGGAAGCTAAGTGAGGTGTGGAGGATAAAAGGTGCGGATAAC 

10 TCTAGTCTTCTCATCAGTGGCTTTCAAGGTGTTGAAAGCATTATCGTTAA 
CAAATGCAAGATGTTTAGAAATGTATTCACACCTACCACCACCAATTTTG 
ATCTGGGGGCACTTATGGAGATTCGGATACAAGATTGTGGAGAAAAGAGG 
AGAAACAACGAATTGGTAGAGAGTAGCCAAGAGCAAGAGCAGGTATGGCT 
TTCAATTTCACTTTCTTACTTAATGAAGGATTAAGCTCCTGCTTTTTGAA 

15 TAAAAAGTGGATGAATGACTAAATTCGGGAATGCCACCCGGAAAGTTATC 
AACCATTTAGCTACACCATTTTTTGAACTAATGTTGCAATAAATGCATAA 
TATAATTAAAAAATGGTCATTGATAAATGTAAACCAACCTTTTTTATTTA 
TTAAAATGTCTACAATAAATGATTTTCTTTATTATATATCATTTTATAAC 
AAT.AAGCTTAAAGATGTTTAAATAGCCAATGTCAGTTATAGATCGTAACT 

20 AATTTTTTATTAACTAGTTTTAGTTAAGATATCACTCATTATTATTTTTA 

TAGAAAAAAGACAAGATTGGCTAATCCTCATAAGAATTTGGAAGATTTAA 
GCA-^WW^TATAGAGCTTTTCCAAACATAGCCAATAGTTTCTTTTGCAGGTC 

AATTTCCATTAATTAAAAAAAAGTTCAAGCCATTTTGTAGTTGGCACCTG 

25 CAA.AATGGTAGTTTGCACCTGCGGAAATCACCTTTCACCATTTCGCATCT 
ATGACTTGTGAAAATGTTAATTTGTGAAATGGTCATGTGCACCTCATGAG 
AAATACGAAATGGTCAGTAATATGACTTTTTTATATAAATATGATGGTGG 
CATATATTTATAGGAAAATATAGCTGCACGATATTAATTAATAGTGAAAT 
TAGTTAACTGTATACGATAAGTATACAAAATTTATATGTATGAAGTATAC 

30 TCA.ATTTAGGACGACTCGGGCAATGAAATCATCATTTAATAGGAGCAATG 
AAATCATTTTCGAAAAATGTTTACAAATGAATAAAATATTAAATTAAACT 
TAAAACATmGTTAGTAGTTTGAAATTTACAAACTGAAATTTGTTGTAT 
TTATTAACATTTATAAATGTTGTACTATGATTTTTTCCTTGTTTGCAAAT 
ATTCCTTAAAAATCCACCTAAAATCAAAATAATTAATCTTTTTCAAGTTG 

35 AAAAATGAAAATCGTATGATATAACCGTGTATGGATGTGGAATTATATAT 
CAGTTACTAATTACATTTTTTGTTGGGATATATGTGCGCAGATTGATATT 
GCAATCCCATTCACTCTCACACACTCTTTCCAAAACCTCCGTAAACTTGC 
TTTGGAAAAGTATGAAGGAGTGGAGGTGGTGTTTGAGATAGAGAGTCCAA 
CAAGTAGAGAATTGATAACAATTCACCATAATCAACAACCACTACTTCCC 

40 AACCTTGAGTTATTGGATATAAGTTTTATGGACAGCATGAGTCATGTATG 
GAAGTGCAACTGGAATAAATTCTTCATTCTTCAAAAACAACAGTCAGAAT 
CCCCATTCTGTAATCTCACAACCATACATATTCAATATTGCCAAAGCATT 
AAGTACTTGTTTTCAACTCTCATGGCAAAACTTCTTTCCAACCTAAAGAA 
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GGTCGAGGTAAGAGAGTGTCATGGTATTGAAGAAGTTGTTTCGAACAGAG 

ATGATGAAGATGAGGAAAAGACTACATTTACATCTACATCTTCTGAAAAA 

AGCACTAATTTGTTCCCTCGTCTTGAATCrCTCGCrCTTTATCAACTTCC 

AAATCTCAAGTGTATTGGTGGTGGTGGTTCTGCCAACAGTGGGAACAATG 

AAATATCTCTTGATAATTCCACTACTACTACTTCTTTTGTTGATCAATCT 

AAGGTATGTTTTTTTTTTTNGTTNCCCTT (SEQroNO:109) 

Sequence gap 

CCTCCCTAATAATACATGTTATGCACACTATACTAACATATTAGACACGT 

AAAGGATAAATGCTATGCCTCATATAATACGTTATATTTATAATCTTTAA 

ACAATCAAATTTATTAAACAAATAACTAAGTGTGAGCAAAGGCAGGTACC 

CGACTAAATTGCCCAAAACCAGTCTGGTGGTTCGTGGAATGTTGGGCCAG 

GTCGTTAAAACGTCTACACACCGGTTCTTTAAATCACAGATCCGCTTCTC 

ATACTGTGAACCCGGTTTTAATrTTAAAAGAAAATTTCATTATAAAGTAA 

ATGACTTAAACCATTACAAACAACAAAAATTTACCATTACAATGTTGGAC 

TATCATTATTTGCAACATAAAACTGAAAATACACATATTTCCTTCTGATA 

TCAGCATGAGTGGCTGGTTGGCTAACCCAAAAATCCATGCATTGTAGATG 

TGTGTTACAACACATAGTATCAATGAAAGGCATATTTTTAGGCTAGAATT 

TAACAATCTGTAATAATATTCCCTAAAACTAATATCATCATCAACCAACT 

AATATAAAACCATTGGGTTCGTCATTTTAGGTACAAAACATAGATTTTTC 

TAAGCITGTTGTATTTAAACATATGCTTTCrAAACTTAATTGATTTTGCA 

TTCCAAAATTTTAGGTTGTAAAGTGGTATGTCAmGTTGTCTTTTCAAC 

ATTAATTGTACAAAAACCAAAACTACATAATTGATGTAGATATCATAACA 

ATrGTCTTATTTAGTATATAAAAACTAAATTTTGAATTGAATTTCTTATA 

CAA.\AGTTGTGTCTATGTATACATGTTTATGTAGGTAATAGACAATTAGT 

CTCTGTTAAGTATATGGAGTTTAATTTTTAGACTAATTTTTCATGTGTTG 

CAGTTTTATCAGGCAGGTGGCGTTTnTGGACGTTATGCCAATACTCCAG 

AGAGATAAATATAAGGGAGTGTTATGCATTGTCAAGTGTAATTCCATGTT 

ATGCAGCAGGACAGATGCAAAATGTTCAAGTGCTGAATATATACAGGTGC 

AACTCAATGAAGGAGTTATTTGAAACTCAAGGGATGAACAACAACAATGG 

TGACAGTGGTTGTGATGAAGGAAATGGTTGTATACCAGCAATTCCAAGAC 

TAA.ATAACGTTATTATGCTACCCAATCTAAAGATATTGAAGATTGAAGAT 

TGTGGTCATCTGGAACATGTATTCACATTCTCTGCACTTGGAAGCCTGAG 

ACAGCTCGAAGAGTTAACGATAGAGAAATGCAAGGCAATGAAAGTGATAG 

TGA.AGGAAGAAGATGAATATGGAGAGCAAACAACAAAGGCATCTTCGAAG 

GAGGTTGTGGTCTTTCCTCGTCTCAAGTCCATTGAACTGGAAAATCTACA 

AGAGCTCATGGGTTTCTACTTAGGGAAGAATGAGATTCAGTGGCCTTCAT 

TGGATAAGGTTATGATCAAGAATTGCCCAGAAATGATGGTGTTTGCACCT 

GGTGAGTCCACAGTTCCCAAGCGCAAGTATATAAATACAAGCTTTGGCAT 

ATATGGGATGGAGGAGGTACTTGAAACTCAAGGGATGAACAACAATAATG 

ATGACAATTGTTGTGATGATGGAAATGGTGGAATTCCAAGACTAAATAAC 

GTTATTATGTTTCCAAATATAAAGATATTGCAAATCAGCAATTGTGGCAG 

TTTGGAACATATATTCACATTCTCTGCACTTGAAAGCCTGATGCAGCTCA 

AAGAGTTAACAATAGCGGATTGCAAGGCAATGAAAGTGATTGTGAAGGAG 
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GAATATGATGTAGAGCAAACAAGGGTATTGAAGGCTGTGGTATTTTCTTG 
TCTAAAGTCCATTACACTATGCCATCTACCAGAGTTGGTGGGTTTCTTCT 
TGGGGAAGAATGAGTTCTGGTGGCCTTCATTGGATAAGGTTACCATCATT 
GATTGCCCACAAATGATGGGGTTCACACCTGGTGGGTCAACAACTTCCCA 
5 CCTCAAGTACATACACTCAAGCTTAGGCAAACATACTCTTGAATGTGGCC 
TTAATTTCAAGTCACAACTACTGCATATCATCAGGTATAATTATTATTCT 
TTNACACCATCTAATTATGGAATCATGACGCTAATTACAGTATTAAACAC 
(SEQIDNOtllO) 

10 RG2K deduced polypeptide sequence (SEQ ID NO:lll) 

MECITGIFSOTFAQOJAPVKEHLrmFYTQYVGDMLTAMTELNAAKDIVEERK 
NQNVEKCFEVPNHVNRWLEDVQTINRKVERVLNDNCNWFNLC^fRYMLAVKAL 
EITQEIDHAMKQLSRIEWTDDSVPLGRNDSTKASTSTPSSDYNDFESREHTFRKAL 
EALGSNHTSHMVALWGMGGVGKTTMMKRLKNIIKEKRTFHYIVLVVIKENMDL 

15 ISIQDAVADYLDMKLTESNESERADKLREGFQAKSDGGKNRFLULDDVWQSVN 
MEDIGLSPFPNQGVDFKVLLTSENKDVCAKMGVEANLIFDVKFLTEEEAQSLFY 
QFVKVSDTHLDKIGKAIVRNCGGLPIAIKTIANTLKhfRNKDVWKDALSRIEHHD 
lETLAHVVFQMSYDNLQNEEAQSIFLLCGLFPEDFDIPTEELVRYGWGLRVFNGV 
YTIGEARHRLNAYIEUJCDSNLLIESDDVHCIKMHDLVRAFVLDTFNRFKHSIJV 

20 NHGNGGMLGWPENDMSASSCmSUCKGMSDFPRDVKFPNUJLKLMHADKS 
LKFPQDFYGEMKKLQVISYDHMKYPLLPTSPQCSTNLRVLHLHQCSLMFDCSSI 
GNLLNLEVLSFANSGIEWLPSTIGNLKELRVLDLTNCDGLRIDNGVLKKLVKLEELY 
MRVGGRYQKAISFTDENCNEMAERSKNLSALEFEFFKNNAQPKNMSFENLERFKIS 
VGCYFKGDFGKIFHSFENTUU.WNRTEVLESRIJSnEU'EKTDVLYLSVGDMN 

25 VEVKLAHLPKSSSFHNLRVLIISECIELRYLFTLDVANTLSKLEHLQVYECDNMEEn 
HTEGRGEVTITFPKLKFLSLCGLPNLLGLCGNVHIINLPQLTELKLNGIPGFTSIYPEK 
DVETSSLLNKEWIPNLEKLDISYMKDLKEIWPCELGMSQEVDVSTLRVIKVSSCDN 
LVNTJT<:NPMPUHHLEELQWCGSffiVIJJNffiIJ)SIGQIGEGINNSSIJmQLQN^^ 
I^E^'WRIKGADNSSmSGFQGVESIIVNKCKMFR^^VFTPTTTNFDLGALMEIRIQDC 

30 GEKRRNNELVESSQEQEQ 

RG2L polynucleotide sequence (SEQ ID NO:112) 

GGA.\GACACAATGATGCAAAGACTGAAGAAGGTTGCCAAAGAAAATAGAA 
TGTTCAGTTACATGGTCGAGGCAGTTATAGGGGAAAAGACAGACCCAATT 

35 GCTATTCAACAAGCTGTAGCCGATTACCTTCGTATACAGTTCAAAGAAAG 
CACTAAACCAGCAAGAGCTGATAAGCTTCGTGAATGGTTCAAGGCCCACT 
CTGNAGACGGTAAGAATAAGTTCCTCGTAATATTTGATGACGTCTGGCAG 
TCCGTTGATCTGGAAGATATTGGNTTAAGTCCTTTTCCAAATCAAGGTGT 
CGACTTCAAGGTCTTGTTGACTTCACGAGACGAACACGTTTGCACAATGA 

40 TGGGGGTTGAAGCTAATTCAGTTATTAATGTGGGACTTCTAACTGAAGTA 
GAAGCACAAAGTCTGTTCCAGCAATTTGTAGAAACTTTTGAGCCCGAGCT 
CTGTAAGATAGGAGAAGTTATCGTAAGAAAGTGTTGCGGTCTACCTATTG 
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CCATCAAAACCATGGCGTGTACTCTAAGAAATAAAAGAAAGGATGCATGG 
AAGGATGCACTTTCACGTATAGAGCACTATGACATTCGTAGTGTTGCGCC 
TAAAGTCTTTGAAACAAGCTATCACAATCTCCAAGACAGGGAGACTAAAT 
CCGTGTTTTTGATGTGTGGTTTGTTTCCTGAAGACTTCAATATTCCTACC 

5 GAGGAGTTGATGAGGTATGGATGGGGCTTAAAGCTATTTGACAGAGTrTA 
TACAATTAGAGAAGCAAGAACCAGGCTCAACACCTGCATTGAGCGACTTG 
TGCAGACAAATTTGTTAATTGAAAGTGATGATGTTGGGTGTGTCAAGATG 
CATGATCTGGTGCGTGCTTTTGTTTTGGGTATGTATTCTGAAGTCGAGCA 
TGCTTCAATTGTCAACCATGGTAATATGCATGGGTGGACTAAAAATGATA 

10 TGAACGACTCTTGCAAAACAGTTTCTTTAACATGCGAGAGTGTGTCTGAG 
TTTCCAGGAGACCTCAAGTTTCCAAACCTAAAGCTTTTGAAACTTATGCA 
TGGAGATAAGATGCTAAGGTTTTCTCAAGACTTTTATGAAGGAATGGAAA 
AGCTCCAGGTAATATCATACCATAAAATGAAGTATCCATTGCTTCCCTCG 
TCACCTCAATGCTCCACCAACCTTCGAGTGCTTCATCTTCATCGGTGTTC 

15 ATTACGGATGCTTGATTGCTCTTGTATCGGAAATTTGACGAATCTGGAAG 
TGTTGAGCTTCGCTAATTCTGGCATTGAACGGATACCTTCAGCAATCGGA 
AATTTGAAGAAGCTTAGGCAACTTGATCTGAGAGGTCGTTATGGTCTTTG 
TATAGAACAGGGTGTCTTGAAAAATTTGGTCGAACTTGAAGAACTTTATA 
TTGGAAATGCATCTGCGTTTAGAGATTATAACTGCAATGAGATGGCAG 

20 

RG2L deduced polypeptide sequence (SEQ ID NO:113) 

EDTMMQRLKKVAKENRMFSYMVEAVIGEKTDPIAIQQAVADYLRIQFKESTKPAR 

ADKLREWFKAHS7DGKNKFLVIFDDVWQSVDLEDIGLSPFPNQGVDFKVLLTSRDE 

HVCTMMGVEANSVINVGLLTEVEAQSLFQQFVETFEPELCKIGEVIVRKCCGLPIAI 

25 KTMACTLRNKRKDAWKDALSRIEHYDIRSVAPKVFETSYHNLQDRETKSVFLMCG 
LFPEDFNIPTEELMRYGWGLKLFDRVYTIREARTRLNTCIERLVQTNLLIESDDVGC 
VKlvIHDLVRAFVLGMYSEVEHASIVNHGNMHGWTKNDMNDSCKTVSLTCESVSEF 
PGDlJaTNIJXIJa.MHGDKMIJlFSQDFYEGMEKLQVISYHmKYPLLPSSPQCST 
mJlVLHLHRCSLRMLDCSaGNLTNLEVLSFANSGIERIPSAIGNLKKLRQLDLRGR 

30 YGLCIEQGVLKNLVELEELYIGNASAFRDYNCNEMA 

RG2M polynucleotide sequence (SEQ ID NO:114) 

GGGGAAGACACAATAGATGCAAAGGCTGAAGAAGTTGCCAAAGAAAAGAG 
AATGTTCAGTTATATCATTGAGGCGGTTATAGGGGAAAAGACAGACCCCA 

35 TTTCCATTCAGGAAGCTATATCATATTACCTTGGTGTAGAGCTCAATGCA 
AATACTAAGTCAGTAAGAGCTGATATGCTTCGTCAAGGGTTCAAGGCCAA 
ATCTGATGTAGGTAAGGATAAATTCTTAATAATACTCGACGATGTATGGC 
AGTCTGTTGATTTGGAAGATATTGGATTAAGTCCATTTCCAAATCAAGGT 
GTTAACTTCAAGGTCCTGTTAACATCACGAGACCGACATATTTGCACTGT 

40 GATGGGGGTTGAAGGTCATTCGATTTTTAATGTGGGACTTCTCACAGAAG 
CAG.AATCAAAAAGATTGTTCTGGCAGTTTGTAGAAGGTTCTGATCCTGAG 
CTCCATAAGATAGGAGAAGATATTGTAAGTAAGTGTTGTGGTCTACCCAT 
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TGCCATTAAAACCATGGCATGTACACTTAGAGATAAAAGTACGGATGCAT 
GGAAGGATGCACTGTCTCGTTTAGAGCATCATGACATTGAAAATGTTGCC 
TCTAAAGTTTTTAGAGCGAGCTATGACCATCTCCAAGACGAGGAGACTAA 
ATCCACTTTTTTTCTATGTGGATTGTTTCCAGAAGATTCCAATATTCCTA 

5 TGGAGGAGTTGGTGAGGTATGGGTGGGGATTGAAATTATTTAAAAAAGTG 
TATACCATAAGAGAAGCAAGAACTAGGCTCAACACTTGCATTGAGCGGCT 
CATCTATACCAATTTGTTGATAAAAGTTGATGATGTTCAGTGCATCAAGA 
TGCATGATCTCATCCGTTCTTTTGTTTTGGATATGTTTTCTAAAGTTGAG 
CATGCTTCGATTGTCAACCATGGTAATACGCTAGAGTGGCCTGCAGATNA 

10 TNTGCACGACTCTTGTAAAGGGCTTTCATTAACATGCAAGGGTANATGTG 
AGTTTTGTGGAGACCTNAANTTTCCAACCCTAATGATTTTAAAACTTATG 
CATGGAGATAAATCGCTAAGGTTT 

RG2M deduced polypeptide sequence (SEQ ID N0:115) 

15 GEDTIDAKAEEVAKEKRMFSYIIEAVIGEKTDPISIQEAISYYLGVELNANTKSVRAD 
MLRQGFKAKSDVGKDKFLIILDDVWQSVDLEDIGLSPFPNQGVNFKVLLTSRDRHI 
CTVMGVEGHSIFNVGLLTEAESKRLFWQFVEGSDPELHKIGEDIVSKCCGLPIAIKT 
MACTLRDKSTDAWKDALSRLEHHDffiNVASKVFRASYDHLQDEETKSTFFLCGLFP 
EDSNIPMEELVRYGWGUaJTOCWTIREARTMJ^TCIEmYTNUJKVDDVQCIKM 

20 HDLIRSFVLDMFSKVEHASIVNHGNTLEWPAD??HDSCKGLSLTCXG?CEFCGDL?F 
PTLMILKLMHGDKSLRF 

RG2N polynucleotide sequence (SEQ ID NO:116) 

AGGTAAAATCCATAACCCTAAATGTTGGTACGCTCATATATCAAATTGCG 

25 TGTTTTGTTGAATGAAAAAAGCATGCTCAAAAAACCAGTGTAAGGCACGG 
TATATGACATATTTATAGTTACTGATAACAAATTATGATAATTTTGGGTT 
TACRTAAGTTAGGATTCGTACTTCAACCAAATGTAATAGTTTTTGTGAGT 
CTATCTATGTATTTGGGGAATCACATTAGCAACGGGATTGTACTAGTAAT 
TCG.\AAAAGTCTTTTAAATAATTTTTCTGTTTATAATTTATGAATAGTTT 

30 TAGCGACATCTAATATTAAATAGAATGTATCTGATATTGAATTAATGTCC 
TTAATGTGAACATAGACCTTTTCCATTTACTAATGCCTAATTATTAGTTT 
CTAATCAATAAATTTTAATTTCTGTTTTATGCTTCTAAGACAATAAAAAT 
CCATGATTTACCTTTAAATATTAACAAAAATGACCATAAATAAATAAAAA 
ATTAGGATACCAAACCCCCCCGCCATGCCCAATGTCTAAATATTCTTGAT 

35 GCTTTTGCTTTTCCCT(nTITCCnTGTTAGTCTATTATTCTGGAGAGTTT 

GAGAGAGTTTCATACAAGAAAATTTCAAGAAGAAAGCAAAGGTCCAGGTA 
TTCTCTTTTCTTAATTATGTATTAACTTACAAGCATTTTTTACACGATCC 
ATGGTTTTTTGTGTATGTTTTTCAAATTGAAACTAGATTGGGACTTTTGC 
CCTTGATGATTCATAAGATATTGCATGGAGTTGAGATTGTGTAAGAAAAG 

40 TGGTGAATAGAAAGAGCAAGTGAATCCAGATATAGTATTGGTAATATATG 
ATGATGAGATAGAGATATGTTAAAACTGGCTAGAAAATTGTTTTAATTTG 
AAATTTAGGTKGTTGAATTTGAAAGATACCAAGCTAATAACTAATTAGTT 
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ATGCTAAWTAGTTATAAAGAACAACAAACTCTTAGTTTTTTTTTTCATGA 

TTTTCAACCTCTTTGTACCAAACTAAATTATAGCAAAATTGAATATCATT 

CTCTGCAATCAATCTTAACTTTTGTTATTATCATCATGTCTAAAATTGCC 

ACAAGTTTATTTTCAAAGTCATATTGGATTATGAAAGGACTATTTTTACC 

AATTACATCTTTACTTTATGGGCCAAAGCTAATACAATCCGACTAAACTA 

AAGGAATATGGGATGCATATAGTTTGCTTCCCGATTATAGATTTCTATCT 

AATITGTCTATTGTACTAATTTAGGTGCCACCACAAGTAAATTTGTTAAA 

TGGATATCGTTAATGCCATTCTTAAACCAGTTGTCGAGACTCTCATGGTA 

CCCGTTAAGAAACACATAGGGTACCTCATTrCCTGCAGGCAATATATGAG 

GGAAATGGGTATCAAAATGAGGGGATTGAATGCTACTAGACTTGGTGTCG 

AAGAGCATGTGAACCGGAACATAAGCAACCAGCTTGAGGTTCCAGCCCAA 

GGCAGGGGTTGGTATGAAGAAGTAGGAAAGATCAATGCAAAAGTGGAAAA 

TTTTCCTAGCGATGTTGGCAGTTGTTTCAATCTTAAGGTTAGACACGGGG 

TCGGAAAGAGAGCCTCCAAGATAATTGAGGACATCGACAGTGTCATGAGA 

GAACACTCTATCATCATCTGGAATGATCATTCCATTCTTCTAGGAAGAAT 

TGATTCCACGAAAGCATCCACCTCAATACCATCAACCGATCATCATGATG 

AGTTCCAGTCAAGAGAGCAAACTTTCACAGAAGCACTAAACGCACTCGAT 

CCT.AACCACAAATCCCACATGATAGCCTTATGGGGAATGGGCGGAGTGGG 

GAAGACGACAATGATGCATCGGCTGAAAAAGGTTGTGAAAGAAAAGAAAA 

TGTTTAATTTTATTGTTGAGGCGGTTGTAGGGGAAAAAACAGACCCCATT 

GCTATTCAATCAGCTGTGGCAGATTACCTAGGTATAGAGCTCAATGAAAA 

AACTAAACCAGCAAGAACTGAGAAGCTTCGTAAATGGTTTGTGGACAATT 

CTGCTGGTAAGAAGATCCTAGTCATACTCGACGATGTATGGCAGTTTGTA 

GATCTGAATGATATTGGTTTAAGTCCTTTACCAAATCAAGGTGTCGACTT 

CAAGGTGTTGTTGACATCACGAGACAAAGATGTTTGCACTGAGATGGGAG 

CTG.AAGTTAATTCAACTTTTAATGTGAAAATGTTAATAGAAACAGAAGCA 

CAAAGTTTATTCCACCAATTTGTAGAAATTTCGGATGATGTTGATCGTGA 

GCTCCATAATATAGGAGTGAATATTGTAAGGAAGTGTGGCGGTCTACCCA 

TTGTCATCAAAACCATGGCGTGTACTCTTAGAGGAAAAAGCAAGGATGCA 

TGGAAGAATGCACTTCTTCGTTTAGTGAACTACAACATTGAAAATATAGT 

GAATGGAGTTTTTAAAATGAGTTACGACAATCTCCAAGATGAGGAGACTA 

AATCCACCTTTTTGCTTTGTGGAATGTTTCCCGAAGACTTTAATATTCCT 

ACCGAGGAGTTGGTGAGGTATGGATGGGGGTTGAAATTATTTAAAAAAGT 

GTATACTATAGGAGAAGCAAGAATCAGGCTCAACACATGCATTGAGCGGC 

TCATTCATACAAATTTGTTGATTGAAGTTGATGATGTTAGGTGCATCAAG 

ATGCATGATCTTGTCCGTGCTTTTGTTTTGGATATGTATTCTAAAGTCGA 

GCATGCTTCCATTGTCAACCATGGTAATACACTAGAGTGGCATGTGGATA 

ATATGCACAACTCTTGTAAAAGACTTTCATTAACATGCAAGGGTATGTCT 

AAGTTTCCTACAGACCTCAAGTTTCCAAACCTCTCGATTTTGAAACTTAT 

GCATGAAGATATATCATTGAGGTTTCCCAAAAACTTTTATGAAGAAATGG 

AGAAGCTTGAGGTTATATCCTATGATAAAATGAAATATCCATTGCTTCCC 

TCATCACCGCAATGCTCCGTCAACCTTTGCGTGTTTCATCTCCATAAATG 

CTCGTTAGTGATGTTTGACTGCTCTTGTATTGGAAATCTGTCGAATCTAG 
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AAGTGCTTAGCTTTGCTGATTCTGCCATTGACCTGTTGCCTTCCACAATC 
GGAATTTTGAAGAAGCTAAGGCTACTGGATTTGACAAATTGTTATGGTCT 
TTGTATAGCTAATGGTGTCTTTAAAAAATTGGTCAAACTTGAAGAGCTCT 
ATATGACAGTGGTTAATGGAGGAGTTCGAAAGGCGATCAGCCTCACTGAG 

5 GATAACTGCAATGAGATGGCAGAACGTTCAAAAGACCTTTCTGCATTAGA 
ACTTGAGTTCTTTGAAAACAATGCTCAGCCAAAGAATATGTCATTTGAGA 
AGCTACAACGATTCCAGATCTCAGTGGGGTGCTATTTATATGGAGCTTCC 
ATAAAGAGCAGGCACTCGTATGAAAACACATTGAAGTTGGTTATTGACAA 
AGGTGAATTATTTGAATCTTGAATGAACGGCCTGTTTAAGAAAACAGAGG 

10 TGTTATGTTTAAGTGTGGGAGATATGAATGATCTTGAAGATRTTGAGGTT 
AAGTCATCCTCACAACYTCTTCAATCTTCTTCGTTCAACAATTTAAGAGT 
CCTTGTCGTTTCAAAGTGTGCAGAGTTGAAACACTTCTTCACACCTGGTG 
TTGCAAACACTTTAAAAAAGCTTGAGCATCTTGAAGTTTACAAATGTGAT 
AATATGGAAGAACTCATACGTAGCAGGGGTAGTGAAGAAGAGACGATTAC 

15 ATTCCCCAAGCTGAAGTTTTTATCTTTGTGTGGGCTACCAAAGCTATCGG 
GTTTGTGCGATAATGTCAAAATAATTGAGCTACCACAACTCATGGAGTTG 
GAACTTGACGACATTCCAGGTTTCACAAGCATATATCCCATGAAAAAGTT 
TGAAACATTTAGTTTGTTGAAGGAAGAGGTAAATATAAATTTTTAATGCT 
AATACATTACAAAGGATCTTTTCAGTTAAATCTTTCAAAATATATTGTAA 

20 TTTGATTGTATGGGGTATTATTGTTGGATGGGACTATTAATAAATGATTA 
TCTTGCAGGTTCTGATTCCTAAGTTAGAGAAACTGCATGTTAGTAGTATG 
TGGAATCTGAAGGAGATATGGCCTTGCGAATTTAATATGAGTGAGGAAGT 
TAAGTTCAGAGAGATTAAAGTGAGTAACTGTGATAAGCTTGTGAATTTGT 
TTCCGCACAAGCCCATATCTCTGCTGCGTCATCTTGAAGAGCTTAAAGTC 

25 AAGAATTGTGGTTCCATTGAATCGTTATTCAACATCCATTTGGATTGTGC 
TGGTGCAACTGGAGATGAATACAACAACAGTGGTGTAAGAATTATTAAAG 
TGATCAGTTGTGATAAGCTTGTGAATCTCTTTCCACACAATCCCATGTCT 
ATACTGCATCATCTTGAAGAGCTTGAAGTCGAGAATTGTGGTTCCATTGA 
ATCGTTATTCAACATTGACTTGGATTGTGCTGGTGCAATTGGGCAAGAAG 

30 ACAACAGAAGCAGCTTAAGAAACATCAAAGTGGAGAATTTAGGGAAGCTA 
AGAGAGGTGTGGAGGATAAAAGGTGGAGATAACTCTCGTCCCCTTGTTCA 
TGGCTTTCAATCTGTTGAAAGCATAAGGGTTACAAAATGTAAGAGGTTTA 
GAAATGTATTCACACCTACCACCACAAATTTTAATCTGGGGGCACTTTTG 
GAGATTTCAATAGATGACTGCGGAGAAAACAGGGAAAATGACGAATCGGA 

35 AGAGAGTAGCCATGAGCAAGAGCAGGTAAGGATTTCAATTTCACTTTCKT 
ACTTAATTAATGATTAAGCTCCTGCTTTTTRAATAAAAAAGGGACAAACC 
ATTTCATGACTTAATGTAGCAATACAAGTCATGTATAAGAGTGACCAACT 
CTTTTTTATTTATAAAATGACTACAAAATA'ri'l'riTTTCATTAGAGATCA 
TGTATAAATGTGACTAATTTTTCATCACCTAACTTTAGTTGATAAATCTT 

40 TATAAATGTCACTAGTTACTTTTCAGTAAAATAACAAATTTAATAAATTA 
TCAACAAAAAGCATCAACTAAAAAAATCCCACAACCCGTAATAATTTAAA 
ATAAAAGGATTTAACATCTAATACGAACAATTTTTTTTCTAAACATGATT 
TGGACCAAATATCACCAGCAACTCAAGTTTGGAATCGATTCAGCTTAAAA 
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CTTGACCARCATAATTAGATAGATGAGAGTTGAAGCTAAAGTGCCTATAT 

AAGTTCGTTTCATCTTTmCTTGATCTTGATAGCAAGTTGAATSATTTT 

CTTCTTCAAAATTGATAAAAATCTACATTATAAAGAGACTAGCTTGAAAA 

AAAATGGTCTAGGTGGGTCTTGGGTCTGGTAGATGAAGATGGAAGGGAGA 

GTAGATTTCAAAGACACAAACACATCTTCATTTTATTTATTTATTTATTA 

TTATTATTTTTTGATATCTTGCTCATATTTGTTACAGATATGTGAGGTCT 

ATTAATCTTTTTAAATATATAAAAAATAAATACA TAAAT GAGAAAATTAA 

ATAAAGAATAAATTAATAAGGGCACAATAGTCTTnTTGGTAAGACAAGG 

ACCAAAAGCGCAACAAAAGTAAACAGTAGGGACCATCCGATTTAAAAAAT 

TAATTAGGGACCAAAAACATAAATTCCCCCAAACCATAGGGACCATTCGT 

GTAATTTACTCTTGCTTTTCGTTTTGTTCATATTTGGGTAACTATTTTTT 

TTGTACATATCTAGGTAACGAACTTGTTGAAAGTGTTCACATCTACGATG 

TGACCTACTACAACCGATCATAATGGTCATATATGAACACTTCCAACAAG 

TTTGTTATCTAGGTGTGTACAAAAAAACGATAGTTACCATGATGTGAACA 

TACCAAAAAATTAATTACCTTAGCAAGTTATTTTCCCATTTAGGTTGTAT 

GGA.\ACAGTTCCGTGAGACCGTGACTTGGATGGTAGATAAATTTAGTAAA 

CTT AACCCTTC AATTAACCTACCTTTTTCTTATTAACTCAATTTCAAGCT 

AAATTCTGATTCTTGTTTGAAAGTAAGTTGCATCTTTATGTTTGTATTAT 

CTTGTTGCATAGGATCCTTAGCATCTTTTAATAATTTATTTGAAGGTGAA 

AGATCCAACTATTTTTAATCTGTTGGCATTTTCCATCATTTGCAACTGTT 

TCTTGAAAAAAA::TACCTAAAATCAAAATAACCATTTTCATATCCAAAA 

TTATAAGAGAGAATTGTTAACGGACATGGAATCATAAATCATTAACACAG 

1TCAGTACACAGGTTGCTAATTACATTTCTTGCTGTGCAGATTGAAATTC 

TATCAGAGAAAGAGACATTACAAGAAGCCACTGGCAGTATTTCAAATATT 

GTATTCCCATCCTGTCTCATGCACTCTTTTCATAACCTCCATAAACTTAA 

CTTGAACAGAGTTGAAGGAGTGGAGGTGGTGTTTGAGATAGAGAGTGAGA 

GTCCAACAAGTAGAGAATTGGTAACAACTCACCATAACCAACAACAACCT 

ATTATACTTCCCAACCTCCAGGAATTGATTCTATGGAATATGGACAACAT 

GAGTCATGTGTGGAAGTGCGGCAACTGGAATAAATTCTTCACTCTTCCAA 

AAGAACAATCAGAATCCCCATTCCACAACCTCAGTAACATACATATTTAT 

GAATGCAAAAGCATTAAGTACTTGTTTTCACCTCTCATGGCAGAACTTCT 

TTCCAACCTAAAGCATATCGAGATAAGAGAGTGTGATGGTATTGAAGAAG 

TTGTTTCAAAAAGAGATGGTGAGGATGAAGACATGACTACATCTAC:::: 

: : :GC ACACAACCACCACTTTTTCCCTCATCTTGATTCTCTCACTCTAAA 

GCA-\CTGAAGAATCTGAAGTGTATTGGTGGAGGTGGTGCCAAGGATGAGG 

GGAGCAATGAAATATCTTTCAATAATACCACTGCAACTACTGCTGTTCTT 

GATCAATTTGAGGTATGCTTTGTACATATTCAATTATTTATTTAATTTCC 

TTGTTAATTTCCTTTTTTCrrTGCAATATTCTATGAAAAAAATCACCAAA 

TCACAAATAAGAGATTTAAACTTTTATTTCACACCCATGCGGACTCAAGA 

ATGGGATTTGGAGGCATATAAAGTTACATTCATTTGAACAAGTATTACCA 

TTTATTTGTTATTTATCATTTTCATATCATTTACTGATAACATTTCTTTT 

TTACTTTTCTAATTAGAAAAGGTCCACATGTCTAATTAGGTTTTCCATTC 

TATGTGAATCCTCTATTCTGTCTGTAATCAAGCATCTTAGATTATTTATC 
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CATTTTCATAATTGTGTTTATATTGACAGTTTTTTTCTTTTTATAGTTGT 

AATTGCAACCTGTCATATWTTMWWKKCWWWATKYWMWWARTAATACATTT 

TATACCCWCTATACTAAGATA 

5 RG2N deduced polypeptide sequence (SEQ ID N0:117) 

U}KTTMMHRlJCKV\aCEKKMFNFIVEAWGEKTDPLAIQSAVADYmffiLNEKT^ 
RTEKIJ«WFVDNSAGKKILVnJ)DWQFVDI>IDIGLSPLPNQGVDFKVLLTSRDKD 
VCTEMGAEVNSTFNVKMIJnETEAQSLFHQFVEISDDVDREIJINIGVNIVRKCGGLPI 
VKTMACTLRGKSKDAWKNALIJRLVNYNIENIVNGVFmSYDNLQDEEmSTF^^ 
10 CGMFPEDF^^PTEELVRYGWGUa.FKKVYTIGEAMRIJ^CIERUHT^«.I^ 

CIK^IHDLVRAFVLDMYSKVEHASIVNHGNTI^WHVDN^dHNSCKRLSLTCKGMSK 
FPTDIJaTNLSIIja.MHEDISLRFPKNFYEEMEKLEVISYDKMKYPLLPSSPQCSVNL 
CVFHIJ^KCSLVMFDCSCIG^^.SNLEVLSFADSAroIXPSTIGILKKLRLLDLTNCYGL 
CIANGVFKKLVKLEELYMTWNGGVRKAISL 

15 

RG20 polynucleotide sequence (SEQ ID N0:118) 

TTGTAAAACGACGGCCAGTCGAATCGTAACCGTTCGTACGAGAATCGCTG 
TCCTCTCCTTCATTTGAATCATGATATTTGAATATCGATACTTTTGACTG 
TAGCTTTTGGGTCGATTTTTTAGCAAGATACATAACTGGCCAAACCCATT 
20 GGCTATTTTAGCCCAAAATATGAAATGGACTGGATTGTTTTTTTCCTTTC 
TAACACGCACACATCTGGCGATCAGTATCACTCCATTATGAAGACCTAGT 
CAAATTCATTAACGTTCAGTCGTTCCTTCAAAGTTTCAAAGTTCCAACTT 

TCTGCGACGAAGGAGAGCTTGGTCAGAGGGCTGTGATTCTTGAGTCTTGA 

25 CCTCCGAATCTAGCTGGATTATTTTCGACACACCAGACCACGTATCAGGT 
TGCTCATCCCGAAATACTGCTTTGCAAACTGTTGTATCATCGCCTAGGAA 
ATTAAGTTTCTTTTTTGGCTCTGTTACTGAATCAGTAGCTTTGCAAC^ 
CTCATTATAAGCTGATCCATATTTTACATATCTTTTGAAGAATAATAGGT 
ACTGACTTTACCTTTCTGATGAGAGCGATTTAAGAGATACCTCTGTAAAA 

30 TCCATTTTTGTGAAGGGATCTGGGTTAGTTTTTAAAGGATTTGCTACAAC 
AGTATCCCACAAACGATCTATTTCCCATTTNACTCATCCGCTCAAGATCT 
ATCCACCTTTATATATGTTAATTGGGAGTCTTCCATGGTGCAATGAATCT 
AGGATGCATTTAGAAGCCCAATCCATTACAAGTTTTCATCCAATTTCATG 
TGACAAGTTGTTGGTTACTATGTAGGTACTTCCACAATTAAGAATTTCCA 

35 GCAATGGATGTTGTTAATGCCATTCTTAAACCAGTTGCCGAGACACTTAT 
GGAACCTGTTAAGAAACATCTAGGCTACATCATTTCCAGCACAAAACATG 
TGAGGGATATGAGTAACAAAATGAGGGAGTTGAACGCTGCAAGACATGCT 
GAAGAAGACCACTTGGACAGGAACATAAGAACTCGTCTTGAGATTTCAAA 
TCAAGTTAGGAGTTGGTTAGAAGAAGTAGAAAAGATCGATGCAAAAGTAA 

40 AAGCCCTTCCTAGTGATGTCACCGCTTGTTGCAGTCTCAAGATCAAACAT 
GAAGTCGGAAGGGAAGCCTTGAAGCTAATTGTGGAGATTGAAAGTGCCAC 
AAGACAACACTCTTTGATCACCTGGACTGATCATCCCATTCCTCTGGGAA 
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AAGTTGATTCCATGAAGGCATCGATGTCCACAGCATCAACCGATTACAAT 

GACTTTCAGTCAAGAGAAAAAACTTTTACTCAAGCATTGAAAGCACTTGA 

ACCAAACAACGCTTCCCACATGATAGCGTTATGTGGGATGGGTGGAGTGG 

GGAAGACCACAATGATGCAAAGACTAAAAAAAGTTGCTAAACAAAATAGA 

ATGTTCAGTTATATGGTTGAGGCAGTTATAGGGGAAAAGACGGACCCAAT 

TGCTATTCAACAAGCTGTAGCGGATTACCTTCGTATAGAGTTAAAAGAAA 

GCACTAAACCAGCAAGAGCTGATAAGCTTCGTGAATGGTTCAAGGCCAAC 

TCTGGAGAAGGTAAGAATAAATTCCTTGTAATACTTGATGACGTCTGGCA 

GTCTGTTGATCTAGAAGATATTGGTTTAAGTCCTTTTCCAAATCAAGGTG 

TCGACTTCAAGGTCTTATTGACTTCACGAGACGAACATGTTTGCACAGTA 

ATGGGAGTTGGATCTAATTCAATTCTTAATGTGGGACTTCTAATAGAAGC 

AGAAGCACAAAGTTTGTTCCAACAATTTGTAGAAACTTCTGAGCCCGAGC 

TCCATAAGATAGGAGAAGATATTGTAAGGAAGTGTTGCGGTCTACCTATT 

GCCATCAAAACCATGGCATGTACTCTTAGAAATAAAAGAAAGGATGCTTG 

GAAGGATGCACTTTCGCGTATAGAGCACTATGACCTTCGCAATGTTGCGC 

CTAAAGTCTTTGAAACGAGCTACCACAATCTCCATGACAAAGAGACTAAA 

TCAGTGTTTTTGATGTGTGGTTTGTTTCCGGAAGACTTCAATATTCCTAC 

TGAGGAGTTGATGAGGTATGGATGGGGATTAAAGATATTTGATAGAGTCT 

ATACATTTATAGAAGCAAGAAACAGGATCAACACCTGCATTGAGCGACTG 

GTGCAGACAAATTTGTTAATTGAAAGTGATGATGTTGGGTGTGTCAAGAT 

GCATGATCTGGTCCGTGCTTTTGTTTTAGGTATGTATTCTGAAGTAGAGC 

ATGCTtCAGTTGTCAACCATGGTAATATACCTGGATGGACTGAAAATGAT 

CCGACTGACTCTTGTAAAGCAATTTCATTAACATGCGAGAGTATGTCTGG 

AAACATTCCAGGAGACTTCAAGTTTCCAAACCTAACGATTTTGAAACTTA 

TGCATGGAGATAAGTCGCTAAGATTTCCACAAGACTTTTATGAAGGAATG 

GAAAAGCTCCAGGTTATATCATACGATAAAATGAAGTATCCAATGCTTCC 

CTTGTCTCCTCAATGCTCCACCAACCTTCGAGTGCTTCATCTCCATGAAT 

GTTCATTAAAGATGTTTGATTGCTCTTGTATTGGAAATATGGCGAATGTG 

GAAGTGTTGAGCTTTGCTAATTCTGGCATTGAAATGTTACCTTCCACTAT 

CGGAAATTTAAAGAAGCTAAGGTTACTTGATTTAACAGATTGTCATGGTC 

TTCATATAACACACGGTGTCTTTAACAATTTGGTCAAACTTGAAGAGTTG 

TATATGGGATTTTCTGATCGACCTGATCAAACTCGTGGTAATATTAGCAT 

GACAGATGTCAGCTACAATGAATTAGCAGAACGTTCAAAAGGCCTTTCTG 

CATTAGAGTTCCAGTTCTTTGAAAACAATGCCCAACCAAATAATATGTCG 

TTTGGGAAACTTAAACGATTCAAGATCTCAATGGGATGCACTTTATATGG 

AGGATCAGATTACTTTAAGAAAACGTATGCTGTCCAAAACACATTGAAGT 

TGGTTACTAACAAAGGTGAACTATTGGACTCTAGAATGAACGAGTTGTTT 

GTTGAAACAGAAATGCTTTGTTTAAGTGTTGATGATATGAATGATCTTGG 

TGATGTTTGTGTGAAGTCCTCACGTTCTCCTCAACCTTCTGTGTTCAAAA 

TTCTAAGAGTCTTTGTCGTTTCCAAGTGTGTTGAGTTGAGATACCTTTTC 

ACA.ATTGGTGTAGCCAAGGATTTGTCAAATCnTGAGCATCTTGAAGTTGA 

TTCATGTAATAATATGGAACAACTCATATGTATTGAGAATGCTGGAAAAG 

AGACAATTACATTCCTAAAGCTGAAGATTTTATCTTTGAGTGGGCTACCA 
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AAGCTTTCGGGTTTGTGCCAAAATGTCAACAAACTTGAGCTACCACAACT 
CATAGAGTTGAAACTTAAGGGCATTCCAGGGTTCACATGCATTTATCCGC 
AAAACAAGTTGGAAACATCTAGTTTGTTGAAGGAAGAGGTAGATATATGT 
TTTATGTTAATACAAGTTAAAAAATCTTTTTAACTAAAAGTTTCAGTATA 

5 TATATCTATATGTCTATAATTTGATTATATGATGTATTAGtGTTTGGATG 
TGGCTATTAAGGGATGATTATTTTGCAGGTTGTGATTCCTAAGTTGGAGA 
CACTTCAAATTGATGAGATGGAGAATTTAAAGGAAATATGGCATTATAAA 
GTTAGTAATGGTGAGAGAGTTAAGTTGAGAAAGATTGAAGTGAGTAACTG 
TGATAAGCTTGTGAATCTATTTCCACACAACCCCATGTCTCTGCTGCATC 

10 ATCTTGAAGAGCTTGAAGTCAAGAAATGTGGTTCCATTGAATCGTTATTC 
AACATCGACTTGGATTGTGTTGATGCCATAGGAGAAGAAGACAACATGAG 
GAGCTTAAGAAACATTAAAGTGAAGAATTCATGGAAGTTAAGAGAAGTGT 
GGTGTATAAAAGGTGAAAATAACTCTTGCCCCCTTGTTTCTGGCTTTCAA 
GCTGTTGAAAGCATAAGCATTGAAAGTTGTAAGAGGTTTAGAAATGTATT 

15 CACACCTACCACCACCAATTTTAATATGGGGGCACTTTTGGAGATATCAA 
TAGATGACTGTGGAGAATACATGGAAAATGAAAAATCGGAAAAGAGTAGC 
CAAGAGCAAGAGCAGGTATGGATTTCAATTTCACTTTCTTACTTACTTAA 
GGATTAAGCTTCTGTTTTTTTGAATAAAAAAGGGACATCTTCTAATAATG 
CACATCTTAAATTAAAAAGTATTTAATTGTTGCATAGCAGCGTATAACAT 

20 CTTCTAATAATTTATCTGAAGGTGAAAGATCCAACTACTTCTAATTTGTT 
AACAATTTCAATCATTTGCAAATGTTCCTTAAAAAATTAATTACCTGAAA 
TCAAAACAATCTTCTTCAAATCCAAAATTATGAGACAGAATTGAGAAGGG 
ATGTGAAATTATAAACCATTAACACAATTCCATGCTCACGTTACTAATTA 
CATTTCTTGTTGGGATATATATGTACAGACTGATATTTTGTCAGAGGAAG 

25 TGA-AATTACAAGAAGTCACTGATACTATTTCTAATGTTGTATTCACATCG 
TGTCTCATACACTCTTTTTATAACAACCTCCGTAAACTCAACTTGGAGAA 
GTATGGAGGAGTTGAGGTTGTGTTTGAGATAGAGAGTTCAACAAGTAGAG 
AATTGGTAACAACATACCATAAACAACAACAACAACAACAACCTATATTT 
CCC.AACCTTGAGGAATTATATCTATATTATATGGACAACATGAGTCATGT 

30 ATGGAAGTGCAACAACTGGAATAAATTTTTACAACAATCAGAATCCCCAT 
TCCACAACCTCACAACCATACACATGTCCGATTGCAAAAGCATTAAGTAC 
TTGTTTTCACCTCTCATGGCAGAACTTCTTTCCAACCTAAAGAGAATCAA 
TATTGACGAGTGTGATGGTATTGAAGAAATTGTTTCAAAAAGAGATGATG 
TGGATGAAGAA 

35 

RG20 deduced polypeptide sequence (SEQ ID NO:119) 

MD\'WAnJCPVAETIJ*4EPVKKHLGYnSSTKHVRDMSNKMRElJ>IAAI^ 
RNmTRLmSNQVRSWLEEVEKTOAKVKALPSDVTACCSLKIKHEVGREALKLIVEIE 
SATRQHSLITWTDHPIPLGKVDSMKASMSTASTDYNDFQSREKTFTQALKALEPNN 
40 ASHMIALCGMGGVGKTTMMQRLKKVAKQNRMFSYMVEAVIGEKTDPIAIQQAVA 
DYlJUEIJCESTKPARADKUlEWFKANSGEGia^KFLVIU)DWQSVDLEDIGLS^ 
NQGVDFKVLLTSRDEHVCTVMGVGSNSILNVGLUEAEAQSLFQQFVETSEPELHKI 
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GEDIVRKCCGLPIAIKTMACTLRNKRKDAWKDALSRIEHYDLRNVAPKVFETSYHN • 
IJIDKETKSVFLMCGLFPEDFNIPTEELMRYGWGLKIFDRVYTFIEARNRINTCIERL 
VQTNUJESDDVGCVKMHDLVRAFVLGMYSEVEHASVVNHGNIPGWTENDPTDSC 
KAISLTCESMSGNIPGDFKFPNLTIlJaj^HGDKSIJaTQDFYEGMEKLQ^ 

5 YPMU»IJSIKK:STNlJlVLHlJ^ECSIJ5MFDCSaGNMA^^V^VI^FANSGIEM 

LKKLRLLDLTDCHGLHITHGVFNNLVKLEELYMGFSDRPDQTRGNISMTDVSYNE 
LAERSKGLSALEFQFFENNAQPNNMSFGKLKRFKISMGCTLYGGSDYFKKTYAVQ 
NTUCLVTNKGELLDSRMNELFVETEMLCLSVDDMNDLGDVCVKSSRSPQPSVFKIL 
RVFWSKCVEUlYIJTlGVAKDl^NLEHLEVDSChn^MEQUCffiNAGKETITFl^^ 

10 LSLSGIPKI^GLCQNVNKI^IJPQUElJaj^GIPGFrCIYPQNfKI^TSSLIJCEEVVIPKL 
ETLQroEMENUCEIWHYKVSNGERVKLRKIEVSNCDKLVNLFPHNPMSLLHHLEEL 
EVKKCGSffiSLFNroU)CVDAIGEEDNMRSLRNIKVKNSWKLREVWCIKGENNSCPL 
VSGFQAVESISIESCKRFR^^VF^>TTTNFNMGAlJJEISroDCGEYMENEKSE^ 
EQTDILSEEVKLQEVTDTISNWF^SCUHSF^^mLRKLNLEKYGGVEVVFEIESSTS 

15 RELVTTYHKQQQQQQPIFPNLEELYLYYMDNMSHVWKCNNWNKFLQQSESPFHN 
LTTIHMSDCKSIKYLFSPLMAELLSNLKRINIDECDGI 

RG2P polynucleotide sequence (SEQ ID NO: 120) 

CCCATTGCTATTCAGGAAGCAGTAGCAGATTACCTCNGTATAGAGCTCAA 

20 AGAAAAAACTAAATCNGCAAGAGCTGATATGCTTCGTAAAATGTTAGTTG 
CCAAGTCCGATGGTGGTAAAAATAAGTTCCTAGTAATACTTGACGATGTA 
TGGCAGTTTGTTGATTTAGAAGATATCGGTTTAAGTCCTTTGCCAAATCA 
AGGTGTTAACTTCAAGGTCTTGCTAACATCACGGGATGTAGATGTTTGCA 
CTATGATGGGAGTCGAAGCCAATTCAATTCTCAACATGAAAATCTTACTA 

25 GATGAAGAAGCACAAAGTTTGTTCATGGAGTTTGTACAAATTTCGAGTGA 
TGTTGATCCCAAGCTTCATAAGATAGGAGAAGATATTGTAAGAAAGTGTT 
GTGGTTTGCCTATTGCCATCAAAACCATGGCCCTTACTCTTAGAAATAAA 
AGCAAGGATGCATGGAGTGATGCACTTTCTCGTTTAGAGCATCATGACCT 
TCACAATTTTGTGAATGAAGTTTTTGGAATTAGCTACGACTATCTTCAAG 

30 ACCAGGAGACTAAATATATCTTTTTGCTTTGTGGATTGTTTCCCGAAGAC 
TACAATATTCCTCCTGAGGAGTTAATGAGGTATGGATGGGGCTTAAATTT 
ATTTAAAAAAGTGTATACTATAAGAGAAGCAAGAGCCAGACTCAACACCT 
GCATTGAGCGGCTTATCCATACCAATTTGTTGATGGAAGGAGATGTTGTT 
GGGTGTGTAAAGATGCATGATCTAGCACTTGCTTTTGTTATGGATATGTT 

35 TTCTAAAGTGCAGGATGCTTCAATTGTCAACCATGGTAGCATGTCAGGGT 
GGCCTGAAAATGATGTGAGTGGCTCTTGCCAAAGAATTTCATTAACATGC 
AAGGGTATGTCTGGGTTTCCTATAGACCTCAACTTTCCAAACCTCACAAT 
TTTAAAACTTATGCATGGAGATAAGTTTCTCAAGTTTCCTCCAGACTTTT 
ATGAACAAATGGAAAAGCTTCAAGTTGTATCGTTTCATGAAATGAAATAT 

40 CCGTTTCTTCCCTCGTCTCCTCAATATTGCTCCACCAACCTTCGAGTTCT 
TCATCTCCATCAATGCTCATTGATGTTTGATTGCTCTTGTATTGGAAATC 
TGTTTAATCTGGAAGTGTTGAGCnTGCTAATTCTGGCATTGAATGGTTA 
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CCTTCCAGAATTGGAAATTTGAAGAAGCTAAGGCTACTAGATTTGACAGA 

TTGTTTTGGTCTTCGTATAGATAAGGGTGTCTTAAAAAATTTGGTCAAAC 
TTGAAGAGGTTTATATGAGAGTTGCTGTTCGAAGCAAAAAAGCCGGAAAT 
AGAAAAGCCATTAGCTTCACAGATGATAACTGCAATGAGATGGCAGAGCG 
5 TTC 

R62P deduced polypeptide sequence (SEQ ID NO:121) 

PIAIQEAVADYL?IElJCEKTKSARADMIJaCMLVAKSDGGKNm.VIIJ)DWQF^ 
EDIGIJSPIJ>NQGV^^FKVIXTSRDVDVCmMGVEANSILNMKILLDEEAQSLFMEFV 

10 QISSDVDPKLHKIGEDIVWCCCGlPIAIKTMALTlJmKSKDAWSDAl^RLEHHDlJ^ 
FVNEWGISYDYLQDQETKYIFLU:GLFPEDYMPPEEDVIRYGWGUaJTCKVYTIRE 
ARARIJ^CmmHTmjJ^GDWGCVKMHDlJVDVFVMDMPSKV 
MSGWPE^roVSGSCQmSLTCKGMSGFProLNFPNLTIIJKLMHGDKFUCFPPDFYEQ 
MEKLQVVSFHEMKYPFLPSSPQYCSTNLRVLHLHQCSLMFDCSCIGNLFNLEVLSF 

15 ANSGIEWU>SWGNUCKUILIJ)LTDCFGUUDKGVUCNLVKLEEVYMRVAVRSKKA 
GNRKAISFTDDNCNEMAERS 

RG2Q polynucleotide sequence (SEQ ID NO:122) 

TGGGGAAGACACAGTGATAGAAAARAAAAAGAATGTTGTGGAAAAGAGGA 

20 AAATGTTTGATTATGCTGTTGTGGCGGTTATAGGGGAAAAGACGGACCCT 
ATTGCTCTTCAGAAAACTGTTGCGGATTACTTGCATATTGAGCTAAATGA 
AAGCACTAAACTAGCAAGAGCAGATAAACTTTGCAAATGGTTCAAGGACA 
ACTCGGATGGAGGTAAGAAAAAGTTCCTCGTAATACTCGACGATGTTTGG 
CAATCTGTTGATTTGGAAGATATTGGTTTAAGTACTCCTTTTCCAAATCA 

25 AGGTGTCAACTTCAAGGTTTTGTTGACATCACGAAAGAGAGAAATTTGCA 
CAATGATGGGAGTTGAAGCTGATTTAATTCTCAATGTCAAAGTCTTAGAA 
GAAGAAGAAGCACAAAAGTTGTTCCTCCAGTTTGTAGAAATTGGTGACCA 
ATACCACGAGCTTCATCAGATAGGGGTACATATAGTAAAGAAGTGTTATG 
GTTTACCCATTGCCATTAAAACCATGGCTCTTACTTTAAGAAATAAAAGA 

30 AAGGATTCATGGAAGGACGCACTCTCTCGTTTAGAGGACCATGACACTGA 
AAATGTTGCAAATGCAGTTTTCGAGATGAACTACCGCAATCTACAAGATG 
AGGAGACCAAAGCCATTTTTTTGCTTTGCGGTTTGTTCCCCGAAGACTTT 
GATATTCCTACTGAGGAGTTGGTGAGGTATGGATGGGGCTTAAATCTATT 
TAA,AAAAGTGTATACCATAAGAAAGGCAAGAACGAGATCGCATACATGTA 

35 TTGAGCGACTCTTGGATTCAAATTTGTTGATTGAAAGTAACGATATTCGG 
TGCGTCAAGATACACGATCTGGTGCGCGCTTTTGTTTTGGATATGTATTG 
TAAAGTTGAGCATGCTTCAATTGTCAACCATGGTAATATGCGGACCGAAT 
ATA.\TATGGCTGACTCTTGCAAAACAATTTCATTAACATACAAGAGTATG 
TCTGGGTTTGAGTTTCCAGGAGACCTCAAGTTTCCAAACCTAACAGTTTT 

40 GAAACTTATGCANGGAGATAAGTCTCTAAGGTTTCCTCAAGACTTTTATC 
AATCAATGGAAAAACTTCGGGTTATATCATATGATAAAATGAAGTATCCA 
TTGCTTCCCTCATCACCTCAATGCTCCACTAACATCCGAGTGCTTCGTCT 
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CCATGAATGTTCATTAAGGATGTTTGATTGCTCTTGTATTGGAAAGCTAT 

TGAATTTGGAAGTCCTCAGCTTTTTTAATTCTAACATTGAATGGTTACCT 

TCCACAATCAGAAATTTAAAAAAGCTAAGGCTACTAGATTTGAGATATTG 

TGATCGTCTTCGTATAGAACAAGGTGTCTTGAAAAATTTGGTCAAACTTG 

AAGAACTTTATACTGGATATACATCAGCGTTTACAGA 

RG2Q deduced polypeptide sequence (SEQ ID NO:123) 

GEDTYIEKKKNWEKRKMFDYAWAVIGEKTDPIAI^KTVADYUnEU^EST^^ 

ADKLCKWFKDNSDGGKKKFLVILDDVWQSVDLEDIGLSTPFPNQGVNFKVLLTSR 

KREICTMMGVEADULNVKVLEEEEAQKLFLQFVEIGDQYHELHQIGVHIVKKCYG 

IJ»IAIKTMALTUINKRKDSWKDALSRLEDHDTENVANAVFEMNYRNLQDEETKAI 

FIXCGLFPEDFDIPTEBLVRYGWGLNU'KKVYTnaCARTRSHTCIER^ 

DmC\aaHDLVRAFVIJDMYCKVEHASIVNHG^MRTEYNMADSCKTISLTYKSMSG 

FEFPGDLKFPNLTVLKLM7GDKSLRFPQDFYQSMEKLRVISYDKMKYPLLPSSPQCS 

TNTRVLRLHECSLRMFDCSCIGKLIJ^LEVI^FFNS^IffiWLPSTIIWLIaaLRLLDLRYC 

DRLRIEQGVLKNLVKLEELYTGYTSAFTE 

RG2S polynucleotide sequence (SEQ ID NO:124) 

ATTTGGGGTTTTACATTTAATTTTTTGTGCATGAATGTGAAAATAGACTG 

CTTATTGATTCTTTGTGTTTCATTGAGTTGATTTTCATTATTACTACCTT 

ACAAATTGCTCAGTGATAGATTTCCATTAATTTGCTAATTCGGTTGCTTC 

TAAATATGTAGGAGCTACTAAAAGCAAAAATATCGAGCAATGTCGGACCC 

AACGGGGATTGCTGGTGCCATTATTAACCCAATTGCTCAGAGGGCCTTGG 

TTCCCGTTACAGACCATGTAGGCTACATGATTTCCTGCAGAAAATATGTG 

AGGGTCATGCAGACGAAAATGACAGAGTTGAATACCTCAAGAATCAGTGT 

AGAGGAACACATTAGCCGGAACACAAGAAATCATCTTCAGATTCCATCTC 

AAATTAAGGATTGGTTGGACCAAGTAGAAGGGATCAGAGCAAATGTGGAA 

AACTTTCCGATTGATGTCATCACTTGTTGTAGTCTCAGGATCAGGCACAA 

GCTTGGACAGAAAGCCTTCAAGATAACTGAGCAGATTGAAAGTCTAACAA 

GACAGCTCTCCCTGATCAGTTGGACTGATGATCCAGTTCCTCTAGGAAGA 

GTTGGTTCCATGAATGCATCCACCTCTGCATCATCAAGTGATGATTTCCC 

ATC.\AGAGAGAAAACTTTTACACAAGCACTAAAAGCACTCGAACCCAACC 

AAC.\ATTCCACATGGTAGCCTTGTGTGGGATGGGTGGAGTAGGGAAGACT 

AGA.\TGATGCAAAGGCTGAAGAAGGCCGCTGAAGAAAAGAAATTGTTTAA 

TTATATTGTTAGGGCAGTTATAGGGGAAAAGACGGACCCCTTTGCCATTC 

AAGAAGCTATAGCAGATTACCTCGGTATACAACTCAATGAAAAAACTAAG 

CCAGCAAGAGCTGATAAGCTTCGTGAATGGTTCAAAAAGAATTCAGATGG 

AGGTAAGACTAAGTTCCTCATAGTACTTGACGATGTTTGGCAATTAGTTG 

ATCTTGAAGATATTGGGTTAAGTCCTTTTCCAAATCAAGGTGTCGACTTC 

AAGGTCTTGTTGACATCACGAGACTCACAAGTTTGCACTATGATGGGGGT 

TGAAGCTAATTCAATTATTAACGTGGGCCTTCTAACTGAAGCAGAAGCTC 

AAAGTCTGTTCCAGCAATTTGTAGAAACTTCTGAGCCCGAGCTCCAGAAG 
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ATAGGAGAGGATATCGTAAGGAAGTGTTGCGGTCTACCTATTGCCATAAA ' 
AACCATGGCATGTACTCTTAGAAATAAAAGAAAGGATGCATGGAAGGATG 
CACTTTCGCGCATAGAGCACTATGACATTCACAATGTTGCGCCCAAAGTC 
TTTGAAACGAGCTACCACAATCTCCAAGAAGAGGAGACTAAATCCACTTT 
5 TTTAATGTGTGGTTTGTTTCCCGAAGACTTCGATATTCCTACTGAGGAGT 
TGATGAGGTATGGATGGGGCTTGAAGCTATTTGATAGAGTTTATACGATT 
AGAGAAGCAAGAACCAGGCTCAACACCTGCATTGAGCGACTGGTGCAGAC 
AAATTTGTTAATTGAAAGTGATGATGTTGGGTGTGTCAAGATGCATGATC 
TGGTCCGTGCTTTTGTTTTGGGTATGTTTTCTGAAGTCGAGCATGCTTCT 

10 ATTGTCAACCATGGTAATATGCCCGAGTGGACTGAAAATGATATAACTGA 
CTCTTGCAAAAGAATTTCATTAACATGCAAGAGTATGTCTAAGTTTCCAG 
GAGATTTCAAGTTTCCAAACCTAATGATTTTGAAACTTATGCATGGAGAT 
AAGTCGCTAAGGTTTCCTCAAGACTTTTATGAAGGAATGGAAAAGCTCCA 
TGTTATATCATACGATAAAATGAAGTACCCATTGCTTCCTTTGGCACCTC 

15 GATGCTCCACCAACATTCGGGTGCTTCATCTCACTAAATGTTCATTAAAG 
ATGTTTGATTGCTCTTGTATTGGAAATCTATCGAATCTGGAAGTGCTGAG 
CTTTGCTAATTCTCGCATTGAATGGTTACCTTCCACAGTCAGAAATTTAA 
AGAAGCTAAGGTTACTTGATCTGAGATTTTGTGATGGTCTCCGTATAGAA 
CAGGGTGTCTTGAAAAGTTTAGTCAAACTTGAAGAATTTTATATTGGAAA 

20 TGCATCTGGGTTTATAGATGATAACTGCAATGAGATGGCAGAGCGTTCTG 
ACAACCTTTCTGCATTAGAATTCGCGTTCTTTAATAACAAGGCTGAAGTG 
AAAAATATGTCATTTGAGAATCTTGAACGATTCAAGATCTCAGTGGGACG 
CTCTTTTGATGGAAATATCAATATGAGTAGCCACTCATACGAAAACATGT 
TGC.\ATTGGTGACCAACAAAGGTGATGTATTAGACTCTAAACTTAATGGG 

25 TTATTTTTGAAAACAAAGGTGCTTTTTTTAAGTGTGCATGGCATGAATGA 
TCTTGAAGATGTTGAGGTGAAGTCGACACATCCTACTCAGTCCTCTTCAT 
TCTGCAATTTAAAAGTTCTTATTATTTCAAAGTGTGTAGAGTTGAGATAC 
CTTTTCAAACTCAATCTTGCAAACACTTTGTCAAGACTTGAGCATCTAGA 
AGTTTGTGAATGCGAGAATATGGAAGAACTCATACATACTGGAATTTGTG 

30 GAGAAGAGACAATTACTTTCCCTAAGCTGAAGTTTTTATCTTTGAGTCAA 
CTACGGAAGTTATCAAGTTTGTGCCATAATGTCAACATAATTGGGCTACC 
ACATCTCGTAGACTTGATACTTAAGGGCATTCCAGGTTTCACAGTCATTT 
ATCCGCAGAACAAGTTGCGAACATCTAGTTTGTTGAAGGAAGAGGTAGAT 
ATATGTTCTTTATGTTAATACAATTTAAATAATATTTTCAACCAAATTTT 

35 CAT.\ATATATCTGTAATTTGATTGTATGATGTGTTATTGTTTATATGTGG 
CTATTAAGGGATGATTATTTTGCAGGTTGTGATTCCTAAGTTGGAGACAC 
TTC.AAATTGATGACATGGAGAACTTAGAAGAAATATGGCCTTGTGAACTT 
AGTGGAGGTGAGAAAGTTAAGTTGAGAGAGATTAAAGTGAGTAGCTGTGA 
TAAGCTTGTGAATCTATTTCCGCGCAATCCCATGTCTCTGTTGCATCATC 

40 TTG.AAGAGCTTAAAGTCAAGAATTGCGGTTCCATTGAATCGTTATTCAAC 
ATTGACTTGGATTGTGTCGGTGCAATTGGAGAAGAAGACAACAAGAGCCT 
CTT.\AGAAGCATCAACATGGAGAATTTAGGGAAGCTAAGAGAGGTGTGGA 
GGATAAAAGGTGCAGATAACTCTCATCTCATCAACGGTTTTCAAGCTGTT 
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GAAAGCATAAAGATTGAAAAATGTAAGAGGTTTAGCAATATATTCACACC 
TATCACCGCCAATTTTTATCTGGTGGCACTTTTGGAGATTCAGATAGAAG 
GTTGCGGAGGAAATGACGAATCAGAAGAGCAGGTAACGCTTTCAATTTAA 
CTTTCTTAAGTAATTAAGGACTAACCTCCTGTTTTTTGAATAATAAAGAG 
5 GTGGGATGACTAAACTTGGGCATCACAATTGCAACAAAATGTTACAAACC 
ATGAAACGTTCAAACCATTTCTTGAATTAAGGTTTCAATACAAGTCATTT 

TTAGAGATCATTATTATAATAGTAAGTTTAAAGCAATTTAAATTAGAACT 
AATTCTAACnTTAGCTAATAAATCGTTATAAATGTAAATAATTACTnTT 

10 AGTGAAATAAGCAACGGATTTAATAAGTTAACAACTTAAATGTCATTTCC 
TAACAAAAAAAACTATTTGGTTCAGAAGAACCGTAATTCAAGATAACTAA 
AATAAAAATATTTGACATTCACTAAGAGCATTTTTTTTTCTAAATATGAT 
TGCAAATGAATAAAACTTAAATTTATACAGAAAAGATTTTTATATATGTT 
ATACAAAATTTACAAATTGAAACTGGATATGTTAATTAACGGTTTATAAT 

15 TCTGGTATCACAAAGGGATATATAATAAAATATTATTTTCTGTAGTCATT 
TAT.AATTGTACTAGTTTATAACCCGTGGGAACCATGAGTTCTAAAATTAG 
TTAAACTTTCATAATAAAAATTTATAATTATTATTTATTTTAAATAAATT 
ATr.AATTAAGAGATGTATCAAAAATTTAAAGTTATTATAACTTCAAATTT 
AACATATAATTAGAAAATATATGATCATAACTTTCCGCAACTCTTCTTTT 

20 GTATTAAAATGCCCAGAGAAGCTCTTAGTAYATTTTCTAAATCAAAGTCA 
CAAAACTAATGAAGCATATAATTTTGTGAAAATCAATTAGCATTAGGTTT 
TAAGAGTCACCAAATTCAAAGAGTAATCCAATGCTTTCATTACCACTATG 
GAG-^VAAATATTTTCTTAGTTTAAATGAAATGAAAACAAACATTCAAACTA 
ATTGTTGCTTACTAAACCAAAGACCCATTACTTAGCCAAGAGTTTAACCA 

25 AAA.AAAATTACATTCATGTATCATTATTCATGACTAGATATATATGAACA 
TGA.\GGGAGTTTTTATAGAAAATATAATCATAGATATTCAACATAACTTC 
ATGGAATTCCTCAAAATAACCAAGTTATTCAAGAAATTACATCCAAGTCA 
ACC.AAAGAGAAGTTTAGCCTAGCATGGCTAAACTCAAGAAAATAAAATAA 
GGATTAGAAGTACCAAACATGTAGTAAGAATCACAGTAAAAGATGATGTT 

30 GTTCTTGATGTTCTTCTAAGTTCTTCAAGTCTCCAGTTGCTCCTAATAAT 
GCAAAGGAGAGCCATTAAATTCGTATGTATTGATCCCTTCAAAAGCTGCA 
CCA-ACCTCCCTTAAATAACACTCAAAGCAAAAATGACAAAATTGCCCCTG 
AAGGACCCTATGCGGGTGCCTTGCGCGGGTGGAGCTGAATACGAAAGGTC 
TTTGGTCTTTGTGAGGGTGATGCTGTGCGGGTTAGCTTGTCGCATGCTTC 

35 CGCGCGGTTCGCGCACATGTGCACAAGTGATGCATGGTGTGTACGTTCTT 
GAGTTTTGAGCCTCCGATGCTTAGTCCATTTGGCCCAATTCGAGTCCAAT 
CAGCTTATGACCCATTTTTCTTCAAGTTATCTTCAAGTTATCTTCAAGTT 
AAGCCCAAATTGCCTTCTCCAAATCATCCATAACTTCACAAAATCGCCCG 
TTCATCnTAATCCCGAATGCACAATTATTCrCCTGTCrTCCTTTTAAGCA 
40 AGATACCACCTTCTTCATGCTTCATCCATCAATAGTACACTTCATGTATC 
ATCTCTACTAGTTATTTAGTCCACAATCCTTATTGTCCTCCAAATTTAAT 
TATCTCATTTAGTTCCCGTTCCACTAGTTTCCTTAAAATTTGCAATTAAG 
CTCACACAAATATTAAGTACCTGAAATGGTCATAAAATAACAAAAAGGAA 
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AATATGCATGAAGATTAACTAAATGATGAACGAAATATGCTAAAATAGAC 
. TATAAAATGAAGTAAATAAAATGAAATTATCGCACTCCGACCACCCTTAT 
AGGCTTGTAGTCCATCCACCCTTCATTCCTTGTACCAATATGGGATGGAA 
ACATCATTAATTAAGCCAAAAAACTAACATATAAGGGGTGAGTGACAAAG 
5 GTA-AGTACTAAAGATGAAAATAATCCATTTTTYTTGTATATACACAACAC 
ACACATAGGGGCAGACGTAGGATTTCATAGTACAGATTGTTGGTGGCACA 
TAAGTGTTGCTGGTGACACliri'1'rrri'CTTTTACGTAGTGGCACAACAG 
TAG.AAAAAACGAIL\AATTCGAAATTTTTTACAATGTGTSTAAAAAAAAYA 
GTGGTTGTTGGTGCCACTATGGACACCAAAGTTGAACTGCCCCTGCGCGC 

10 RCACACACACACACATAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAG 
ARAGWAWGRRRGAKAKARMCSMSYTTGGGATGTGATACTTCTTTTAGGAA 
AATGGAGTTATATCTTTGATATTGTATTTTTTTAATGTAATTTATATATT 
TAATCATTTTAGTTTATAAGTTTTATTTATTTTGATATGAAAAAAAAAGT 
CTTTTATACATTGGATTTAACATAAAAATCCAACAATATTAATCAAAAAG 

15 ACC.\MACATGTGGACAMWTATGTATATAAWTAATTCACAATAGTCTTTAG 
GAATAGNATTATATATATAATTAATTCTCAATGGTCTTAGGAATAGTAAG 
TTCTTATATTTCAAACTTTNGCCACAATTCTTTGKTTACTTWGACACTTY 
CCTCTCTCTAATTATATATATATATATATATATATATATATATATACACA 
CACACACACACACACTAGATGTGTGCCCGCGCAAAGCAGTGACGTNNNGG 

20 AGA.\NACTTTCTTAAGCATAAATAATTATTATATTTTTTATTGGGTATTA 
TAT.\ATAAAAAATTACAACTTTTAAATAAAATATTTATGTTTATACTTTA 
TATTTATATTGCTTGTATACTATrAATATAATAAATTAATATTTATGTCT 
AATTTATGAAATGIAAATTAATTTAAATACATGAATTTAATATTTTTAAA 
ATTTTCAGTTTGCTTCAAATTGAGTTTCTTAATTATTTTTTTTAATTCAN 

25 GTATTCAAACTTTTGGTAAGTATTAAAGAATTATTTATGCACAATTGATT 
TATACAAAAAACTTTGTAACTTATACATCTTAAAATTCAAGATATAACTA 
ACATGTTTTACAATATATATATATATANATATATATATATATATATATAT 
ATATATATATATATATAGTAAAGCGCANAGGTCATAGGNANAGANTATTT 
TCTATTATTCTACGTTTTGCCACAAAAGTTTGAACACTTTGCCACTTTTT 

30 GTCCCTCCTTAACCTTTTCAATGTTTTGCGACAAAAGTTCCAAAACTTTG 
CCACTTTGATCATTCCTCAACTTTTCACCGCATTAGTTTGTGGAGTTGGC 
AGTTTTGGTCCCCCTAACTTCGATATTTTCTCCTGCTAGCCAAAAAGGGT 
TCCAGAGTTTCACANTTTTGGTCCCTGACAATAACCAAATGTGAGATGTC 
AAATTTTTGCCACATTAGTTTGTGGAGTTGTCCCTTTTGGTCCCCCCACA 

35 TTCGATATTCTACTATACGACCTTATTTTTCTCAAATAACAACACGTATA 
TTT.^TTACCAATGATAGAAATAGATATCAAATAAAGTATTTGTAACACC 
GTGTAAGAACGGTGCTACTATAGGTAAAAATAAACATTTCAAAGTACGAT 
GTCCTAATTGGAAAAAGAGTTTTAAAAAAATAACAACTAGGGGCGAGTTT 
TTTTTACAAGTTTGTATCAAATCATATCAAAATTTAAGGTGGAACGGTGA 

40 CCACATTAACCAGAAATGTAATTTATTCTTTGATTTTGATAATTTTTAAT 
ATTTTGTTGTGATCTATGTATTTAAAAGTAAACAACAAAGAACATAATCC 
AAAACCCTAAATTGCAAGTCTCGCCCAATTTCTCTATCACTAGTCGTCAC 
TTACGATGGCGTTACGTCGCTCTCTCACTTCTTACAACCCTTTGTTGCTA 
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CTCATTACAATAACGAAAAGTTGAATATCCATATATTTATTTGGATGTGG 

AATTGAACAAATCTCGTCAAATTTTTGATTTTGTTGATGGATTTGAGTAG 

AAGTTTGGGCAGAACGGGAATGATGGTCTGCAAGTGGTTATAAACTTGAT 

TCTGAGTTATTACTATATATGTAGCCTCnTACAACGACCAAGGTTTCTT 

CCAGGTACCATTrGATCTTTTTAGAACCCAGTTGTCTGAAACACCCTGAT 

TTGGATCAAATATCACCAACAACTCTTAAAAACTTGATTAATCAATTGTT 

TTCTTCATCTTGATAACAAGTGGAATGATTTTCTACTTAGATTAACTTGA 

AAAAAAAGGTCCATGTGCGTCTGGTGGATCTGGTAAATGAAGATGGAAGG 

GAGAGCrGACTTTAAAGACACAAACACGTCACCATATCTTTTATTTTATT 

TTAAATTTGCTTTTTTCCTATTTCnTrCTTTCTTGATCTCCAGATC^^ 

GTGGTGTGGATAATTTACACATAGAGATTGGGAACGACTGTGTTTTAGAG 

AGGACGTGGCTTGGGGTTGAGGATGGTTTATGGCTGGCCGAGTTTCATTT 

ATATAAACAAACAAATATATAAAACAAGGGGTAAAATGGCCATCTTATAT 

GTATTTAACCGTCCTTTTTTAmTTTTTTTATTmAAAm 

GGTATACCAGTGTCAGCCTCTTATTCCCAACCAGGCAACCAGTCAAATAG 

GGACTTAGGTTGTTTGGAAACAGTTCCGTGAGACCGTGACTTGGATGGTA 

GAT.^ATTTAGTAAACrTAACCCTTCAATTAACCTACCTTTTTCTTATTA 

ACTCAATTTCAACCTAAATTCTGATTCTTGTTTGAAAATAAGTTGCATCT 

TTATGTTTGTATTATCCTGTTGCATAGGATCCTTAGCATCTTTTAATAAT 

TTATTrGAAGGTGAAAGATCCAACTATTTTTTAGCTGTTGGCATTTTCCA 

TCATTTGCAACTGTTTCTTGAAAAAAAAATACCTAAAATCAAAATAACCA 

TTTTCAAATCCAAAATTATAAGAGAGAATTGTTAATGGACGTGGAATCGT 

AAATCATTAACACAGTTCAGTACACAAGTTGCTAATTACATTTCTTGCTG 

TGCAGATTGAAATTCTATCAGAGAAAGAGACATTACAAGAAGTCACTGAT 

ACT.\ATATTTCTAATGATGTTGTATTATTCCCATCCTGTCTCATGCACTC 

TTTTCATAACCTCCATAAACTTAAATTGGAGAGAGTTAAAGGAGTGGAGG 

TGGTGTTTGAGATAGAGAGTGAGAGTCCAACAAGTAGAGAATTGGTAACA 

ACTCACCATAACCAACAACATCCTATTATACTTCCCAACCTCCAGGAATT 

GGATCTAAGTTTTATGGACAACATGAGTCATGTGTGGAAGTGCAGCAACT 

GGAATAAATTCTTCACTCTTCCAAAACAACAATCAGAATCCCCATTCCAC 

AACCTCACAACCATACACATGTTCAGCTGCAGAAGCATTAAGTACTTGTT 

TTCGCCTCTCATGGCAGAACTTCTTTCCAACCTAAAGGATATCTGGATAA 

GTGGGTGTAATGGTATTAAAGAAGTTGTTTCAAAGAGAGATGATGAGGAT 

GAAGAAATGACTACATTTACATCTACCCACACAACCACCATCTTGTTCCC 

TCATCTTGATTCTCTCACTCTAAGACTACTGGAGAATCTGAAGTGTATTG 

GTGGAGGTGGTGCCAAGGATGAGGGGAGCAATGAAATATCTTTCAATAAT 

ACCACTGCAACTACTGCTGTTCTTGATCAATTTGAGGTATGCTTTGTACA 

TATTCAATTATTTATTTAATTTCCTTTTTTCTTTGCAATATTCTATAAAT 

AATACATTTTATACCCACTATACTAAGATAATAATTACCTAGAGGGATGG 

ATGCTATGACACAGCTGCTACACTTCAGAAACTCTAGTAAGGGCAGTTAT 

GGA.\GTTCAATAAAATGATAATGGCATCTTTTGATGGGTAATATAGGCAA 

TTT.\AGTTTTATTTCTGTTAAAGCAGTATTTAGCAAGTACTGGCCAGTAG 

GAGAGGAGAATATCACCTTTTGTGAAAATCTGGTCATTGTACCCAAGAAT 
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TTAGTTAAATGTAACATTTTAGATATCAGGGGACATCAGGTGACAGATAT 
TGIAGAATAGAACAATATATAATATTACCCAAAACTATTTTTTCTAAGGT 
TATTCTGTTAAATATGTGCTTTCTTGATTTCATTGAATTTGCATTCCTAT 
ATTTTAGGTGGTAAAGTGATTGTCTCTTCAATAAATCCCGAAATTAATTA 
5 AAAAAAAAAAAAACAAAAGTAAATTTTTGATATGGAGAGCACTGGTATCA 
TTTAGTATATAAAAAAACTAGATTTTGAATTAAGTTTCTTATATAAAAGC 
TGTGTATATAGTTTAATTAGTTTTACATCATTTTTCCATGTGGTGTTGCA 
GTTGTCTGAAGCAGGTGGTGTTTCTTGGAGTTTATGCCAATACGCTAGAG 
AGATAGAGATATCTAAGTGTAATGTATTGTCAAGTGTGATTCCATGTTAT 

10 GCAGCAGGACAAATGCAAAAGCTTCAAGTGCTGAGAGTAACGGGTTGTGA 
TGGCATGAAGGAGGTATTTGAAACTCAATTAGGGACGAGCAGCAACAAAA 
ACAGAAAGGGTGGTGGTGATGAAGGAAATGGTGGAATTCCAAGAGTAAAT 
AAC.AATGTTATTATGCTTCCCAATCTAAAGACATTGAAAATCTACATGTG 
CGGGGGTTTGGAACATATATTCACATTCTCTGCACTTGAAAGCCTGACAC 

15 AGCTCCAAGAGTTAAAGATAGTGGGTTGCTACGGAATGAAAGTGATTGTG 
AAG.\AGGAAGAAGATGAATATGGAGAGCAGCAAACAACAACAACAACAAC 
AACGAAGGGGGCATCTTCTTCTTCTTCTTCTTCTTCTTCTAAGAAGGTTG 
TGGTCTTTCCCCGTCTAAAGTCCATTGAACTATTCAATCTACCAGAGCTG 
GTAGGATTCTTCTTGGGGATGAATGAGTTCCGGTTGCCTTCATTGGAAGA 

20 AGTTACCATCAAGTATTGCTCAAAAATGATGGTGTTTGCAGCTGGTGGGT 
CCACAGCTCCCCAACTCAAGTATATACACACAAGATTAGGCAAACATACT 
CTTGATCAAGAATCTGGCCTTAACnTrCATCAGGTATATATATATTCCTT 
TAATTGGCATGATCTAATTAAGAAAGATATCATTCCTGCCAAGTAAATTT 
ACTTCAAACACATTCACACTGGTTTCAGTCTAAGTTTATGTTGTTCTAGG 

25 AAGGCCAAAATGGGAAAGCAAGATAGGGAAAAATAGTGTATTTCAGTGGA 
AAGGGTATTTTAGGTATTTTCTGTCAAAAGTTGTTATTGCAGGCTTTTTA 
GTACCTGGAATCGTGTGTGGGAGGAGCGTTATTATTCTGATTTGCTTGTT 
TCTTTATCATTTTTTCTTAGCCTCTCGAACAGCTAGAAACCCTTTTAATC 
TTTTGATTTTAAATGACAAAATTTTTCCCTGTTACTCTATTTGATTGTTG 

30 TTCTTCATGGTTCTAAGTGAGTTATTGGCTCATCTGTTACTTCTTTTGAT 
TGTrATTTTCATATCATGTTGTCCTTTGAATCAAGCITITCCATTTTCAA 
CCAGGGCAAAAGGTCAAAAGTAACCTACTTTATGAGATCAAAAACAGCAA 
CCCATCGGATAACTTTTAGTTGGAGTTAATAGTTACAATTACCATTGTGA 
TTA.\TAATTATAATATCTTGTATTAATTCATTAAAATTGGTACAGCACAT 

35 ATATGACATTTTAAAGGTTTGTTTTTGTTWGACATATATATGCCTCTGGC 
GTTTTCTTTATTGGACATGCAGACCTCATTCCAAAGTTTATACGGTGACA 
CCTCGGGCCCTGCTACTTCAGAAGGGACAACTTGGTCTTTTCATAACTTG 
ATCGAATTAGATATGGAATTAAATTATGATGTTAAAAAGATTATTCCATC 
CAGTGAGTTGCTGCAACTGCAAAAGCTGGAAAAGATTCATGTGAGTAGTT 

40 GTTATTGGGTAGAGGAGGTATTTGAAACTGCATTGGAAGCAGCAGGGAGA 
AATGGAAATAGTGGAATTGGTTTTGATGAATCGTCACAAACTACTACTAC 
TACTACTCTTTTCAATCTTCGAAACCTCAGAGAAATGAAGTTGCATTTTC 
TACGTGGTCTGAGGTATATATGGAAGAGCAATCAGTGGACAGCATTTGAG 
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TTTCCAAACCTAACAAGAGTTCATATAAGTAGGTGTAGAAGGTTAGAACA 
TGTATTTACTAGTTCCATGGTTGGTAGTCTATTGCAACTCCAAGAGCTAG 
ATATTAGTTGGTGCAACCATATGGAGGAGGTGATTGTTAAGGATGCAGAT 
GTTTCTGTTGAAGAAGACAAAGAGAGAGAATCTGATGGCAAGACGAATAA 

5 GGAGATACTTGTGTTACCTCGTCTAAAATCCTTGAAATTAAAATGCCTTC 
CATGTCTTAAGGGGTTTAGCTTGGGGAAGGAGGATTTTTCATTCCCATTA 
TTGGATACTTTAGAAATCTACAAATGCCCAGCAATAACGACCTTCACCAA 
GGGAAATTCTGCTACTCCACAGCTAAAAGAAATAGAAACAAGATTTGGCT 
CGTTTTATGCAGGGGAAGACATCAACTCCTCTATTATAAAAAGATCAAAC 

10 AACAGGTAAATCAGATCTTTGTTGCTTTAATAATTCTTAAACTACATTTG 
AAAAGCTTCATGCAAGTTTTTTTTGTTATATTGTCAAAAACCGCAACCTA 
CATTTTCAGCTTTATATTTATGTACTTTATGCAGGAGTTCAAACAAAACT 
CTGATTAATGTGAAGTGAATATTAAAGGTAAATTATATTTTCATGTTCCT 
AGTTGCCTATTAATTAATGGCCrmAGTTCRTGATTTTTGGATGTAGTY 

15 WTCATGATGATGTGAATCTTCTAATACCCCATTCATTGTTTGGTTGAATG 
TTGACTCTATGTCAGGATGAATATTCAAGGGAAGAATTGTTCATCATATG 
AAGGACATTAAAGAACATGGATGCTATGAAGATGTTGGAARAC 

RG2S deduced polypeptide sequence (SEQ ID NO: 125) 

20 MSDPTGIAGAimPIAQRALWVTDHVGmiSCRKYVRVMQTKMTELNTSRISVEEH 
ISRNTRNHLQIPSQKDWIJJQVEGIRANVENFPIDVITCCSUURHKLGQKAFKITEQI 
ESLTRQLSLISWTDDPVPLGRVGSMNASTSASSSDDFPSREKTFTQALKALEPNQQF 
HM\'ALCGMGGVGKTRMMQRIJ[KAAEEKKLFNYIVRAVIGEKTDPFAIQEAIADYL 
GIQL^mKTKPARADKUlEWFKKNSDGGKTKFLIVLDDVWQLVDLEDIGLSPFPNQG 

25 VDFKVLLTSRDSQVCTMMGVEANSnNVGLLTEAEAQSLFQQFVETSEPELQKIGED 
rVRKCCGLPIAIKTMACTLRNKRKDAWKDALSRIEHYDIHNVAPKVFETSYHNLQE 
EETKSTFLMCGLFPEDFDIPTEELMRYGWGLKLFDRVYTIREARTRLNTCIERLVQT 
NLUESDDVGCVKMHDLVRAFVLGMFSEVEHASIVNHGNMPEWTENDITDSCKRIS 
LTCKSMSKFPGDFKFPhHLMIUaj^GDKSLRFPQDFYEGMEKLHVISYDKMKYPLL 

30 PI^RCSTNmVLHLTKCSLKMFDCSCIGNLSNLEVLSFANSRIEWLPSTVRNLKKLR 
LLDLRFCDGLRIEQGVLKSLVKLEEFYIGNASGFTODNCNEMAERSDNLSALEFAFF 
NNKAEVKNMSFENLERFKISVGRSFDGNINMSSHSYENMLQLVTNKGDVLDSKLN 
GmJCTKVliT^VHGMNDI^DVEVKSTHPTQSSSFCNUCVUISKCVELRYLFKm 
ANTl^RLEHLEVCECENMEEUHTGICGEETITFPiaJaFI^l^QLPKLSSLCHNVNnG 

35 LPHLVDLILKGIPGFTVIYPQNKLRTSSLLKEEVVIPKLETLQIDDMENLEEIWPCELS 
GGEKVKLREKVSSCDKLVNLFPRNPMSLLHHLEELKVKNCGSIESLFNIDLDCVGA 
IGEEDNKSIiRSINMENLGKLREVWRIKGADNSHUNGFQAVESIKIEKCKRFSNIFT 
PITANFn.VALL£IQIEGCGGNHESEEQIEILSEKETLQEVTDTNISNDVVLFPSCLMH 
SFHNLHKLKLERVKGVEVVFEIESESPTSRELVTTHHNQQHPIILPNLQELDLSFMD 

40 NMSHVWKCSNWNKFFTLPKQQSESPFHNLTTIHMFSCRSIKYLFSPLMAELLSNLK 
DI\\1SGCNGIKEVVSKRDDEDEEMTTFTSTHTTTILFPHLDSLTLRLLENLKCIGGGG 
AKDEGSNEISFNNTTATTAVLDQFELSEAGGVSWSLCQYAREIEISKCNVLSSVIPCY 
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AAGQMQKLQVmWGCDGMKEVFETQLGTSSNKNRKGGGDEGNGGIPRVNNNVI- 
MLPNLKTLKIYMCGGLEHIFTFSALESLTQLQELKIVGCYGMKVIVKKEEDEYGEQ 
QTTTTTTTKGASSSSSSSSSKKVVVFPRLKSIELFNLPELVGFFLGMNEFRLPSLEEVT 
KYCSKMMVFAAGGSTAPQLKYIHTRLGKHTLDQESGLNFHQTSFQSLYGDTSGPA 
5 TSEGTWSFHNUEIJ)MEDm)VKKIIPSSELLQI^KLEKIHVSSC'^^ 

EAAGRNGNSGIGFDESSQTTTTTTLFNLRNLREMKlilFI^GIJlYIWKSNQWTAre^ 
PNLTRVHISRCRRLEHVFTSSMVGSLLQLQELDISWCNHMEEVIVKDADVSVEEDK 
ERESDGKT^«CEILVU>RIJ^SlJaJ^CLPCLKGFSLGKEDFSFPIJJ)TLElYKCPAITTFT 
KGNSATPQUCEffiTRFGSFYAGEDmSSIIKRSNNRSSNKTU^^V^C.ttJC 

10 

R62T polynucleotide sequence (SEQ ID NO: 126) 

GGA_\GACGACAATGGTGCAACGGTTGAAGAAGGTTGTGAAAGATAAGAAG 

ATGTTCCATTATATTGTCGAGGTGGTTGTAGGGGCAAACACTGACCCCAT 

TGCTATCCAGGATACTGTTGCAGATTACCTCAGCATAGAACTGAAAGGAA 

15 ATACGAGAGATGCAAGGGCTTATAAGCTTCGTGAATGCTTTAAGGCCCTC 
TCTGGTGGAGGTAAGATGAAGTTCCTAGTAATTCTTGACGATGTATGGAG 
CCCTGTTGATCTGGATGATATGGGTTTAAGTTCTTTGCCAAATCAAGGTG 
TTGACTTCAAGGTCTTGCTGACATCACGCAACAGTGATATCTGCATGATG 
ATGGGAGCTAGTTTAATTTTCAACCTCAATATGTTAACAGACGAGGAAGC 

20 ACATAATTTTTTCCGTCGATACGCAGAAATTTCTTATGATGCTGATCCCG 
AGCTTATTAAGATAGGAGAAGCTATTGTAGAGAAATGTGGTGGTTTACCC 
ATTGCCATCAAAACTATGGCCGTTACTCTTAGAAATAAACGCAAAGATGC 
ATGGAAAGATGCACTTTCTCGTTTAGAGCACCGTGACACTCATAATGTTG 
TGGCTGATGTTCTTAAATTGAGCTACAGCAATATCCAAGACGAGGAGACT 

25 CGGTCGATTTTTTTGCTATGTGGTTTGTTTCCTGAAGACTTTGATATTCC 

TACCGAAGACTTAGTGAGGTATGGATGGGGATTGAAAATATTTACCAGAG 
TGTATACTATGAGACATGCAAGAAAAAGGTTGGACACGTGCATTGAGCGG 
CTTATGCATGCCAACATGTTGATAAAAAGTGATAATGTTGGATTTGTCAA 
GATGCATGATCTGGTTCGTGCTTTTGTTTTGGGCATGTTATCTGAAGTCG 

30 AGCATGCATCAATTGTCAACCATGGGGATATGCCAGGGTGGTTTGAAACT 
GCA.AATGATAAGAACAGCTTGTGCAAAAGAATTTCATTAACATGCAAAGG 
TATGTCTGCGATTCCTGAAGACCTCACGTTTCCAAACCTCTCGATCCTGA 
AATTAATGGATGGAGACGAGTCACTGAGGTTTCCTGAAGGCTTTTATGGA 
GAAATGGAAAACCTTCAGGTTATATCATATGATAACATGAAGCAGCCATT 

35 TCTTCCACAATCACTTCAATGCTCCAATGTTCGAGTGCTTCATCTCCATC 
ACTGCTCATTAATGTTTGATTGCTCTTCTATTGGAAATCTTTTGAATCTC 
GAGGTGCTCAGCATTGCTAATTCTGCCATTAAATTGTTACCCTCCACTAT 
TGGAGATCTGAAGAAGCTAAGGCTCCTGGATTTGACAAATTGTGTTGGTC 
TCTGTATAGCTAATGGCGTCTTTAGAAATTTGGTCAAACTTGAAGAGCTT 

40 TATATGAGAGTTGATGATCGAGATTCGTTTTTTGTGAAAGCTGATGACAG 
CAAGACCATTACCT 
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RG2T deduced polypeptide sequence (SEQ ID NO:127) 

KTTMVQRlJCKVVKDKmFHYIVEVWGANTDPIAIQDTVADYL^^ 

AYK1JIECFKAI^GGGKMKFLVIU)DWSPVDLDDIGI^SIJ»NQGVDFKVLLTS^ 

DICMMMGASLIFNLNMLTDEEAHNFFRRYAEISYDADPELIKIGEAIVEKCGGLPIAI 

KTMAVTUU^nOlKDAWKDAI^RLEHRDTHNWADVUa^YSmQDEETRSIF^ 

LFPEDFDIF^EDLWYGWGIJ□FTRVYTMRHARKRIJ)TCffiRLMHANMIJKSD^^^ 

FVKMHDLVRAFVlX}MLSEVEHASIVNHGDMPGWFETANDKNSLCmSL^ 

AIPEDLTFPNI^XIiOJvroGDESUlFPEGFi^GEMENLQVISYDNMKQPFLPQSLQCSN 

WVLHLHHCSLMFDCSSIGNLU^LEVLSIANSAIKLIJ'STIGDlJXlJaJJDLTNCV 

CIANGVFRNLVKLEELYMRVDDRDSFFVKADDSKTIT 



RG2U polynucleotide sequence (SEQ ID NO:128) 

GCCTTGTGTGGGATGGGTGGAGTGGGAAAGACCACTGTGATGAAGAAGCT 

GAAGGAGGTTGTGGTAGGAAAGAAACTGTTTAATCATTATGTTGAGGCGG 

TTATAGGGGAAAAGACAGACCCCATTGCTATTCAACAAGCTGTTGCCGAG 

TACCTTGGTATAAGTCTAACCGAAACCACTAAACCAGCAAGAACTGATAA 

GCTCCGTACATGGTTTGCAAACAACTCAAATGGAGGAAAGAAGAAGTTCC 

TGGTAATACTAGACGATGTATGGCAACCAGTTGATTTGGAAGATATTGGT 

TTAAGTCGTTTTCCAAATCAAGATGTTGACTTCAAGGTCTTGATTACATC 

ACGGGACCAATCAGTTTGCACTGAGATGGGAGTTAAAGCTGATTTAGTTC 

TCAAGGTGAGTGTCCTGGAGGAAGCGGAAGCACACAGTTTGTTCCTCCAA 

TTTTTAGAACCTTCTGATGATGTCGATCCTGAGCTCAATAAAATCGGAGA 

AGAAATTGTAAAGAAGTGTTGCAGACTACCCATTGCTATCAAAACCATGG 

CCTGAACTCTTAGAAGTAAAAGTAAGGATACATGGAAGAATGCCCTTTCT 

CGTTTACAACACCATGACATTAACACAATTGCGTCTACTGTTTTCCAAAC 

TAGCTATGACAATCTCGAAGACGAGGTGACTAAAGCTACTTTTTTGCTTT 

GTGGTTTATTTCCGGAGGACTTCAATATTCCTACCGAGGACCTATTGAGG 

TATGGATGGGGATTGAAGTTATTCAAGGAAGTAGATACTATACGAGAAGC 

AAGATCCAAGTTGAAAGCCTGCATTGAGCGGCTCATGCATACCAATTTGT 

TGATCGAAGGTGATGATGTTAGGTACGTTAAGATGCATGATCTGGTGCGT 

GCTTTTGTTTTGGATATGTTTTCTAAAGCCGAGCATGCATCTATTGTCAA 

CCATGGTAGTAGTAAGCCAAGGTGGCCTGAAACTGAAAGTGATGTGAGCT 

CCTCTTGCAAAAGAATTTCATTAACATGCAAGGGTNTG 

RG2U deduced polypeptide sequence (SEQ ID NO:129) 

ALCGMGGVGKTTVMKKLKEVVVGKKLFNHYVEAVIGEKTDPIAIQQAVAEYLGIS 

LTETTKPARTDKUITWFANNSNGGKKKFLVILDDVWQPVDLEDIGLSRFPNQDVD 

FK\TJTSRDQSVCTEMGVKADLVLKVSVLEEAEAHSLFLQFLEPSDDVDPELNKIGE 

EIVKKCCRLPIAIKTMA.TLRSKSKDTWKNALSRLQHHDINTIASTVFQTSYDNLEDE 

VTKATFLLCGLFPEDFNIPTEDLLRYGWGLKLFKEVDTIREARSKLKACIERLMHTN 
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LUEGDDWYVKNIHDLVRAFVLDMFSKAEHASIVNHGSSKPRWPETESDVSSSCKR- 
ISLTCKG? 



RG2V polynucleotide sequence (SEQ ID NO:130) 

5 CTGTGGAAGACACGAATGATSAAGAAGCTGAAGGAGGTCGTGGAACAAAA 
GAAAATGTTCAATATTATTGTTCAAGTGGTCATAGGAGAGAAGACAAACC 
CTATTGCTATTCAGCAAGCTGTAGCAGATTACCTCTCTATTGAGCTGAAA 
GAAAACACTAAAGAAGCAAGAGCTGATAAGCTTCGTNAATGGTTCGAGGA 
CGATGGAGGAAAGAATAAGTTCCTTGTAATACTTGATGATGTATGGCAGT 

10 TTGTCGATCTTGAAGATATTGGTTTAAGTCCTCTGCCAAATAAAGGTGTC 
AACTTCAAGGTCTTGTTGACGTTAAGAGATTCACATGTTTGCACTCTGAT 
GGGAGCTGAAGCCAATTCAATTCTCAATATAAAAGTTTTAAAAGATGTTN 
AAGGACAAAGTTTGTTCCGCCAGTTTGCTAAAAATGCAGGTGATGATGAC 
CTGGATCCTGCTTTCAATGGGATAGCAGATAGTATTGCAAGTAGATGTCA 

15 AGGTTTGCCCATTGCCATCAAAACCATTGCCTTAAGTCTTAAAGGTAGAA 
GCAAGCCTGCGTGGGACCATGCGCTTTCTCGTTTGGAGAACCATAAGATT 
GGTAGTGAAGAAGTTGTGCGTGAAGTTTTTAAAATTAGCTATGACAATCT 
CCAAGATGAGGTTACTAAATCTATTTTTWTACmGTGCTTTATTTCCTG 
AAGATTTTGATATTCCTATTGAGGAGTTGGTGAGGTATGGGTGGGGCTTG 

20 AAATTATTTATAGAAGCAAAAACTATAAGAGAAGCAAGAAACAGGCTCAA 
CACCTGCACTGAGCGGCTTAGGGAGACAAATTTGTTATTTGGAAGTGATG 
ACATTGGATGCGTCAAGATGCACGATGTGGTGCGTGATnTGTTTGGTAT 
ATATTCTCAGAAGTCCAGCACGCTTCAATTGTCAACCATGGTAATGTGTC 
AGAGTGGCTAGAGGAAAATCATAGCATCTACTCTTGTAAAAGAATTTCAT 

25 TAACATGCAAGGGTATGTCTGAGTTTCCCAAAGACCTCAAATTTCCAAAC 
CTTTCAATTTTGAAACTTATGCATGGAGATAAGTCGNTGAGCTTTCCTGA 
AGACTTTTATGGAAAGATGGAAAAGGTTCAGGTAATATCATATGATAAAT 
TGATGTATCCATTGCTTCCCTCATCACTTGAATGCTCCACTAACGTTCGA 
GTGCTTCATCTCCATTATTGTTCATTAAGGATGTTTGATTGCTCTTCAAT 

30 TGGTAATCTTCTCAACATGGAAGTGCTCAGCTTTGCTAATTCTAACATTG 
AATGGTTACCATCTACAATTGGAAATTTGAAGAAGCTAAGGCTACTAGAT 
TTGACAAATTGTAAAGGTCTTCGTATAGATAATGGTGTCTTAAAAAATTT 
GGTCAAACTTGAAGAGCTTTATATGGGTGTTAATGTCCGTATGGACCAGG 
CCGT 

35 

RG2V deduced polypeptide sequence (SEQ ID NO:131) 

LWKTRM?KKIJCEWEQKmFNIIVQVVIGEKTNPIAIQQAVADYLSffiLKENTKEAR 
ADKLR?WFEDDGGKNKFLmDDWQFVDl£DIGI^PIJ>NKGVNFKVLLTLRDSH 

VCTLMGAEANSILNIKVLKDV7GQSLFRQFAKNAGDDDLDPAFNGIADSIASRCQGL 
40 PIAIKTIALSLKGRSKPAWDHALSRLENHKIGSEEVVREVFKISYDNLQDEVTKSIF7L 
CALFPEDFDIPIEELVRYGWGLKLHEAKTIREARNRLNTCTERLRETNLLFGSDDIG 
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CVKMHDVVW)FVWYIFSEVQHASIVNHGNVSEWI^ENHSIYSCmSLTCKGM^^^ 
PKDUCFPNI^IIJaMHGDKS?SFPEDFi'GKMEKVQVISYDKLMYPLLPSSLECSTNV 
RVU^mYCSIJl^lFDCSSIGNLLNMEVI^FANS^^EWlJ>STIGNljaa^U)LTO^ 
LRIDNGVLKNLVKLEELYMGVNVRMDQAV 



R62W polynucleotide sequence (SEQ ID NO:132) 

TTGGGAAAGAGACAATGATGAAGAATTGAAAGAGGTTGTGGTTGAAAAGA 

AAATGTTTAATCATTATGTGGAGGCGGTTATAGGGGAGAAGACGGACCCC 

ATTGCTATTCAGCAAGCCGTTGCAGAGTACCTTGGTATAATTCTAACAGA 

AACCACTAAGGCAGCAAGAACCGATAAGCTACGTGCATGGCTTTCTGACA 

ATTCAGATGGAGGAAGAAAGAAGTTCCTAGTAATACTAGACGATGTATGG 

CATCCGGTTGATATGGAAGATATTGGTTTAAGTCGTTTCCCAAATCAAGG 

TGTCGACTTCAAGGTCTTGATTACATCACGGGACCAAGCTGTTTGCACTG 

AGATGGGAGTTAAAGCTGATTCAGTTATCAAGGTGAGTGTCCTAGAGGAA 

GCTGAAGCACAAAGCTTATTCTGCCAACTTTGGGAACCTTCTGATGATGT 

CGATCCTGAGCTCCATCAGATTGGAGAAGAAATTGTAAGGAAGTGTTGTG 

GTTTACCCATTGCAATAAAAACCATGGCCTGCACTCTTAGAAGTAAAAGC 

AAGGATACATGGAAGAATGCACTTTCTCGTTTACAACACCATGACATTAA 

CACAGTCGCGCCTACTGTTTTTCAAACCAGCTATGACAATCTCCAAGATG 

AGGTGACTGGAGATACTTTTTTGCTATGTGGTTTGTTTCCGGAGGACTTC 

GATATTCCTACTGAAGACTTATTGAAGTATGGATGGGGCTTAAAATTATT 

CAAGGGAGTGGATTCTGTAAGAGAAGCAAGATACCAGTTGAACGCCTGCA 

TTGAGCGGCTCGTGCATACCAATTTGTTGATTGAAAGTGATGTTGTTGGG 

TGCGTCAAGTTGCACGATCTGGTGCGTGCTTTTATTTTGGATATGTTTTG 

TAAAGCGGAGCATGCTTCGATTGTCAACCATGGTAGTAGTAAGCCTGGGT 

GGCCTGAAACTGAAAATGATGTGATCAGGACCTCCTGCAAAAGAATCTCA 

TTA.\CATGCAAGGGTATGATTGAGTTTTCTAGTGACCTCAAGTTTCCAAA 

TGTCTTGATTTTAAAACTTATGCATGGAGATAAGTCGCTAAGGTTT 

RG2W deduced polypeptide sequence (SEQ ID NO:133) 

WERDNDEEUCEVVVEKKMF^mYVEAVIGEKTDPIAICK^AVAEYIX}I^.TETTKAAR 

TDKIJIAWLSDNSDGGRKKFLVILDDVWHPVDMEDIGLSRFPNQGVDFKVLITSRD 

QAVCTEMGVKADSVIKVSVLEEAEAQSLFCQLWEPSDDVDPELHQIGEEIVRKCCG 

LPIAIKTMACTLRSKSKDTWKNALSRLQHHDINTVAPTVFQTSYDNLQDEVTGDTF 

LXCGLFPEDFDn^EDLLKYGWGIiOJKGVDSVREARYQLNACIERLVHTNLLIESD 

WGCVKLHDLVRAFILDMFCKAEHASIVNHGSSKPGWPETENDVIRTSCKRISLTCK 

GMIEFSSDLKFPNVLILKLMHGDKSLRF 



RG5 polynucleotide sequence (SEQ ID NO:I34) 

GGGGGGGTGGGGAAGNCGACTCTAGCCCAGAAGNTCTATAATGACCATAA 
AATAAAAGGAAGCTTTAGTAAACAAGCATGGATCTGTGTTTCTCAACAAT 
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ATTCTGATATTTCAGTTTTGAAAGAAGTCCTTCGGAACATCGGTGTTGAT 
TATAAGCATGATGAAACTGTTGGAGAACTTAGCAGAAGGCTTGCAATAGC 
TGTCGAAAATGCAAGTTTCTTTCTTGTGTTGGATGATATTTGGCAACATG 
AGGTGTGGACTAATTTACTCAGAGCCCCATTAAACACTGCAGCTACAGGA 
5 ATAATTCTAGTAACAACTCGTAATGATACAGTTGCACGAGCAATTGGGGT 
GGAAGATATTCATCGAGTAGAATTGATGTCAGATGAAGTAGGATGGAAAT 
TGCTTTTGAAGAGTATGAACATTAGCAAAGAAAGTGAAGTAGAAAACCTA 
CGAGTTTTAGGGGTTGACATTGTTCGTTTGTGTGGTGGCCTCCCCCTAGC 
CTT 

10 

RG5 deduced polypeptide sequence (SEQ ID NO:135) 

GGVGKTTLAQK7YNDHKIKGSFSKQAWICVSQQYSDISVLKEVLRNIGVDYKHDET 
VGELSiaOJUAVENASFFLVLDDIWQHEVWTNLLRAPLNTAATGIILVTTRNDTVA 
RAIGVEDmVELMSDEVGWKLLLKSMNISKESEVENLRVLGVDIVRLCGGLPLAL 

15 

RG7 polynucleotide sequence (SEQ ID NO:136) 

GGTGGGGTTGGGAAGACAACGGGCACAAGGAGGCGACTGCCAATACTTCC 
GACTTTTATTCATAGAGATGACGAGTCTTATTTTCCTACTACTATAGGGA 

GGATATTTGGTTGCGCGAGACGATTCATTGCGCGAAGGGATTCTATCCTT 
20 CTTTTTTTCCGCGAAGACTTCGTTCCGGAGGACGGGCTATATTCCCTTTA 
ATATTAGTCTAGCCCAGTCTAGGCCAACCATATGGCGATGCGGTAGACCT 
CCCAGAGATAGATACTTGATCTTAGAGGATTCACACGTTCAATGGTGGAA 
ACTTAAGGAACCGGCTAAGAGTGACTAAACGGAAAAACCCTATTCATTCC 
ATAGCCTCATCCGGTCGAGGCATTAAACAATCCATCCCAATCCTCTTTCC 
25 TTTGGTCTACTCTAATGATGTGCCCGTTCGTTGGTGGAATATCTCTTTAT 
ACCGACGATTTATATGGGGATTGCCACTAGCGTTG 

The above examples are provided to illustrate the invention but not to limit 
its scope. Other variants of the invention will be readily apparent to one of ordinary sicill 
30 in the art and are encompassed by the appended claims. All publications, patents, and 
patent applications cited herein are hereby incorporated by reference. 
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WT^ATT^; TT AIMED IS: 

1 . An isolated nucleic acid constract comprising an RG polynucleotide which encodes 
an RG polypeptide having at least 60% sequence identity to an RG polypeptide from an 
RG family selected from the group consisting of: an RGl polypeptide, an RG2 
polypeptide, an RG3 polypeptide, an RG4 polypeptide, an RG5 polypeptide, and an RG7 
polypeptide. 

2. The nucleic acid construct of claim 1, wherein the RG polynucleotide encodes an 
RG polypeptide comprising an leucine rich region (LRR). 

3. The nucleic acid construct of claim 1, wherein the RG polynucleotide encodes an 
RG polypeptide comprising a nucleotide binding site (NBS). 

4. The nucleic acid construct of claim 1, wherein the polynucleotide is a fiill length 
gene. 

5 . The nucleic acid construct of claim 1 , wherein the further encodes a fusion protem. 

6. The nucleic acid construct of claim 1, wherein the RGl polypeptide is encoded by 
an RGl polynucleotide sequence. 

7. The nucleic acid construct of claim 6, wherein the RGl polypeptide is encoded by a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:l (RGl A), 
SEQ ID N0:2 (RGIB), SEQ ID NO: 3 (RGIC), SEQ ID N0:4 (RGID), SEQ ID N0:5 
(RGIE), SEQ ID N0:6 (RGIF), SEQ ID N0:7 (RGIG), SEQ ID NO:8 (RGIH), SEQ ID 
N0:9 (RGII). and SEQ ID NO:10 (RGIJ). 

8. The nucleic acid construct of claim 1 , wherein the RG2 polypeptide is encoded by 
an RG2 polynucleotide sequence. 



9. The nucleic acid construct of claim 8, wherein the RG2 polypeptide is encoded by a 
polynucleotide sequence selected from the group consisting of: SEQ ID N0:21 (RG2A); 
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SEQ ID NO:23 (RG2B); SEQ ID NO:25 (RG2C); SEQ ID NO:27 (RG2D); SEQ ID 
NO:29 (RG2E); SEQ ID N0:31 (RG2F); SEQ ID NO:33 (RG2G); SEQ ID NO:35 
(RG2H); SEQ ID NO:37 (RG2I); SEQ ID NO:39 (RG2J); SEQ ID N0:41 (RG2K); SEQ 
ID NO:43 (RG2L); SEQ ID NO:45 (RG2M); SEQ ID NO:87 (RG2A); SEQ ID NO:89 

5 (RG2B); SEQ ID N0:91 (RG2C); SEQ ID NO:93 (RG2D) and SEQ ID NO:94 (RG2D); 
SEQ ID NO:96 ( RG2E); SEQ ID NO:98 (RG2F); SEQ ID NO:100 (RG2G); SEQ ID 
NO:102 (RG2H); SEQ ID NO:104 (RG2I); SEQ ID NO:106 (RG2J) and SEQ ID NO:107 
(RG2J); SEQ ID NO:109 (RG2K) and (SEQ ID NOillO (RG2K); SEQ ID N0:1 12 (RG2L); 
SEQ ID NO: 114 (RG2M); SEQ ID NO: 1 1 6 (RG2N); SEQ ID NO: 1 1 8 (RG20); SEQ ID 

10 NO: 120 (RG2P); SEQ ID NO:122 (RG2Q); SEQ ID NO:124 (RG2S); SEQ ID N0:126 
(RG2T); SEQ ID NO:128 (RG2U); SEQ ID NO:130 (RG2V); and, SEQ ID N0:132 
(RG2W). 

10. The nucleic acid construct of claim 1 , wherein the RG3 polypeptide is encoded by 
IS an RG3 polynucleotide sequence. 

1 1 . The nucleic acid construct of claim 10, wherein the RG3 polypeptide is encoded by 
a pohTiucleotide sequence as set forth in SEQ ID NO:68. 

20 12. The nucleic acid construct of claun 1, wherein the RG4 polypeptide is encoded by 
an RG4 polynucleotide sequence. 

13. The nucleic acid construct of claun 12, wherein the RG4 polypeptide is encoded by 
a polynucleotide sequence as set forth in SEQ ID NO:69. 

25 

14. The nucleic acid construct of claim 1, wherein the RG5 polypeptide is encoded by 
an RG5 polynucleotide sequence. 

15. The nucleic acid construct of claim 14, wherein the RG5 polypeptide is encoded by 
30 a pohTiucleotide sequence as set forth in SEQ ID NO: 134, 
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16. The nucleic acid construct of claim 1, wherein the RG7 polypeptide is encoded by • 
an RG7 polynucleotide sequence. 

17. The nucleic acid construct of claim 16, wherein the RG7 polypeptide is encoded by 
a polynucleotide sequence as set forth in SEQ ID NO: 136. 

18. The nucleic acid construct of claim 1, ftirther comprising a promoter operably 
linked to the KG polynucleotide. 

19. The nucleic acid construct of claim 18, wherein the promoter is a plant promoter. 

20. The nucleic acid construct of of claim 19, wherein the plant promoter is a disease 
resistance promoter. 

21. The nucleic acid construct of claim 19, wherein the plant promoter is a lettuce 
promoter. 

22. The nucleic acid construct of claim 18, wherein the promoter is a constitutive 
promoter. 

23. The nucleic acid construct of claim 18. wherein the promoter is an inducible 
promoter. 

24. The nucleic acid construct of claim 18, wherein the promoter is a tissue-specific 
promoter. 

25. A nucleic acid construct comprising a promoter sequence from an RG gene linked 
to a heterologous polynucleotide. 

26. A transgenic plant comprising a recombinant expression cassette comprising a 
promoter operably linked to an RG polynucleotide. 
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27. The transgenic plant of claim 26, wherein the plant promoter is a plant promoter. • 

28. The transgenic plant of claim 26, wherein the plant promoter is a viral promoter. 

5 29. The transgenic plant of claim 26, wherein the plant promoter is a heterologous 
promoter. 

30. The transgenic plant of claim 26, wherein the plant is letmce. 

10 31 . The transgenic plant of claim 26, wherein the RG polynucleotide is selected from 
the group consisting of SEQ ID N0:1 (RGIA), SEQ ID N0:2 (RGIB), SEQ ID NO: 3 
(RGIC). SEQ ID N0:4 (RGID), SEQ ID NO:5 (RGIE), SEQ ID N0:6 (RGIF), SEQ ID 
N0:7 (RGIG), SEQ ID N0:8 (RGIH), SEQ ID N0:9 (RGII), and SEQ ID NO:10 
(RGIJ). 

15 

32. The transgenic plant of claim 26, wherein the RG polynucleotide is selected from 
the group consisting of SEQ ID N0:21 (RG2A); SEQ ID NO:23 (RG2B); SEQ ID NO:25 
(RG2C); SEQ ID NO:27 (RG2D); SEQ ID NO:29 (RG2E); SEQ ID N0:31 (RG2F); SEQ 
ID NO:33 (RG2G); SEQ ID NO:35 (RG2H); SEQ ID NO:37 (RG2I); SEQ ID NO:39 

20 (RG2J); SEQ ID NO:41 (RG2K); SEQ ID NO:43 (RG2L); SEQ ID NO:45 (RG2M); SEQ 
ID NO:87 (RG2A); SEQ ID NO:89 (RG2B); SEQ ID N0:91 (RG2C); SEQ ID NO:93 
(RG:D) and SEQ ID NO:94 (RG2D); SEQ ID NO:96 ( RG2E); SEQ ID NO:98 (RG2F); 
SEQ ID NO:100 (RG2G); SEQ ID NO:102 (RG2H); SEQ ID NO:104 (RG21); SEQ ID 
NO: 106 (RG2J)and SEQ ID NO: 107 (RG2J); SEQ ID NO: 109 (RG2K) and (SEQ ID 

25 NO: 1 10 (RG2K); SEQ ID N0:1 12 (RG2L); SEQ ID N0:1 14 (RG2M); SEQ ID N0:1 16 
(RG2N); SEQ ID N0:1 18 (RG20); SEQ ID NO:120 (RG2P); SEQ ID NO:122 (RG2Q); 
SEQ ID NO:124 (RG2S); SEQ ID NO:126 (RG2T); SEQ ID NO:128 (RG2U); SEQ ID 
NO: 1 30 (RG2V); and, SEQ ID NO: 132 (RG2W). 

30 33. The transgenic plant of claim 26, wherein the RG polynucleotide is selected from 
the group consistmg of SEQ ID NO:68 (RG3) and SEQ ID NO:69 (RG4). 
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34. The transgenic plant of claim 26, wherein the RG polynucleotide comprises a 
sequence as set forth in SEQ ID NO: 134 (RG5). 

35 . The transgenic plant of claim 26, wherein the RG polynucleotide con^rises a 
sequence as set forth in SEQ ID NO: 136 ^G7), 

36. The transgenic plant of claim 26, wherein the RG polynucleotide encodes an RGl 
polypeptide selected from the group consisting of SEQ ID NO: 11 (RGl A), SEQ ID NO: 12 
(RGIB), SEQ ID NO: 13 (RGIC), SEQ ID N0:14 (RGID), SEQ ID N0:15 (RGIE), SEQ 
ID N0:16 (RGIF). SEQ ID N0:17 (RGIG), SEQ ID N0:18 (RGIH). SEQ ID N0:19 
(RGII), and SEQ ID NO:20 (RGIJ). 

37. The transgenic plant of claim 26, wherein the RG polynucleotide encodes an RG2 
polypeptide selected from the groi^ consisting of SEQ ID NO:22 and SEQ ID N0:41 
(RG2A); SEQ ID NO:24 and SEQ ID NO:42 (RG2B); SEQ ID NO:43 (RG2C): SEQ ID 
NO:44 (RG2D); SEQ ID NO:45 (RG2E); SEQ ID NO:46 (RG2F); SEQ ID NO:47 
(RG2G); SEQ ID NO:48 (RG2H); SEQ ID NO:49 (RG2I); SEQ ID NO:50 (RG2J); SEQ 
ID NO:51 (RG2K); SEQ ID NO:52 (RG2L); SEQ ID NO:53 (RG2M); SEQ IDNO:88 
(RG2A); SEQ ID NO:90 (RG2B); SEQ ID NO:92 (RG2C); SEQ ID NO:95 (RG2D); SEQ 
ID NO:97 (RG2E); SEQ ID NO:99 (RG2F); SEQ ID NO: 101 (RG2G); SEQ ID NO: 103 
(RG2H); SEQ ID N0:105 (RG2I); SEQ ID NO:108 (RG2J); SEQ ID N0:1 1 1 (RG2K); SEQ 
ID KG: 1 13 (RG2L); SEQ ID N0:1 15 (RG2M); SEQ ID N0:1 17 (RG2N); SEQ ID N0:1 19 
(RG20); SEQ IDN0:121 (RG2P); SEQ IDNO:123 (RG2Q); SEQ IDNO:125 (RG2S); 
SEQ ID NO: 127 (RG2T); SEQ ID NO: 129 (RG2U); SEQ IDN0:131 (RG2V); and, SEQ ID 
NO:133(RG2W). 

38. The transgenic plant of claim 26, wherein the RG polynucleotide encodes an RG3 
polypeptide with a sequence as set forth by SEQ ID NO: 138. 



39. The transgenic plant of claim 26, wherein the RG polynucleotide encodes an RG4 
polypeptide with a sequence as set forth by SEQ ID NO: 139. 



wo 98/30083 PCTAJS98/00615 

175 

40. The transgenic plant of claim 26, wherein the RG polynucleotide encodes' an RG5 ' 
polypeptide with a sequence as set forth by SEQ ID NO:135. 

41 . A method of enhancing disease resistance in a plant, the method comprising 
5 introducing into the plant a recombinant expression cassette comprising a promoter 

functional in the plant and operably linked to an RG polynucleotide sequence. 

42. The method of claim 41, wherein the plant is a lettuce plant. 

10 43 . The method of claim 4 1 , wherein the RG polynucleotide encodes an RG polypeptide 
selected from the group consisting of SEQ ID NO:22 and SEQ ID N0:41 (RG2A); SEQ 
ID NO:24 and SEQ ID NO:42 (RG2B); SEQ ID NO:43 (RG2Q; SEQ ID NO:44 (RG2D); 
SEQ ID NO:45 (RG2E); SEQ ID NO:46 (RG2F); SEQ ID NO:47 (RG2G); SEQ ID 
NO:48 (RG2H); SEQ ID NO:49 (RG2I); SEQ ID NO:50 (RG2J); SEQ ID N0:51 

15 (RG2K); SEQ ID NO:52 (RG2L); SEQ ID NO:53 (RG2M); SEQ ID NO:88 (RG2A); SEQ 
ID NO:90 (RG2B); SEQ ID NO:92 (RG2C); SEQ ID NO:95 (RG2D); SEQ ID NO:97 
(RG2E); SEQ ID NO:99 (RG2F); SEQ ID NO:101 (RG2G); SEQ ID NO:103 (RG2H); SEQ 
ID NO:105 (RG2I); SEQ ID NO:108 (RG2J); SEQ ID N0:1 11 (RG2K); SEQ ID N0:1 13 
(RG2L); SEQ ID NO: 11 5 (RG2M); SEQ ID NO: 1 1 7 (RG2N); SEQ ID NO: 1 1 9 (RG20); 

20 SEQIDN0:121 (RG2P); SEQ IDNO:123 (RG2Q); SEQ IDNO:125 (RG2S); SEQ ID 

NO: 1 27 (RG2T); SEQ ID NO:129 (RG2U); SEQ ID N0:131 (RG2V); and, SEQ ID NO:133 
(RG2W). 

44. The method of claim 4 1 , wherein the RG polynucleotide encodes an RG polypeptide 
25 selected from the group consisting of SEQ ID NO: 138 (RG3); SEQ ID NO: 139 ^G4); and 

SEQ ID NO: 135 (RG5). 

45. The method of claim 41, wherein the promoter is a tissue-specific promoter or a 
plant disease resistance promoter. 

30 
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46. The method of claim 41, wherein the promoter is a constitutive promoter or an 
inducible promoter. 

47. A method of detecting RG resistance genes in a nucleic acid sample, the method 
comprising: 

contacting the nucleic acid sample with an RG polynucleotide to form a 
hybridization complex; and, 

wherein the formation of the hybridization complex is used to detect the RG 
resistance gene in the nucleic acid sample. 

48. The method of claim 47, wherein the RG polynucleotide is an RGl polynucleotide. 

49. The method of claim 47, wherein the RG polynucleotide is an RG2 polynucleotide. 

50. The method of claim 47, wherein the RG polynucleotide is an RG3 polynucleotide, 
an RG4 polynucleotide, an RG5 polynucleotide or an RG7 polynucleotide. 

51. The method of claim 47, wherein the RG resistance gene is amplified prior to the 
step of contacting the nucleic acid sample with the RG polynucleotide. 

52. The method of claim 51 , where the RG resistance gene is amplified by the 
polymerase chain reaction. 

53 . The method of claim 47 , wherein the RG polynucleotide is labeled. 



54. An RG polypeptide having at least 60% sequence identity to a polypeptide selected 
from the group consisting of: an RGl polypeptide, an RG2 polypeptide, an RG3 
polypeptide, an RG4 polypeptide, an RG5 polypeptide, and an RG7 polypeptide. 
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Boi I Obserifatioiis where certain daimf were round unieerchable (COodnuellon of Item 1 of first sheet) 

Hui btanationai icpoit hu not been crtabliihfrf m mpeot of certain claima under Aitklo 17(2)(a) for the following icaaons: 

1. ri ClainuNos.: 

becaiiae they relate to aubjecl matxer not requiied to be learcfaed by Ihia AuthoriQr, namely: 



2. nn Claims Nos.: 7, 9, 11, 13, 15, 17, 31-40, 43-44 

bficansr. Ihqr rdatcto parta of the inleniational application that do not comply with the prescribed requirements to such 
an cadBot tliat no meaninsfol international aeareh can be carried out, speoifically: 

these cfadms are dnwn to numerous sequences identified by SBQ ID NOs. However, since no computer readable form 
was submitted, no meaniogfol search oouU be carried out 



rn Claims Nos. 



because they aie dependeni claims and are not diafted in accordance with the second and dud srntmofs of Rule 6.4(a). 
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This International Searching Authority found multiple inventions in this international applieation, u follows: 



□ As all required additional search fees were timely paid by the appUcant, this international search report covers all searchable 



claims, 



2. As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 

3. As only some of the required additional search feea were timely paid by the applicant, this international search report coven 
only those elaims for which fees were paid, specifically claims Nos.: 



4. No required additional aeareh fees were timely paid by the applicanL Consequently, this international search report is 

restricted to the invention first mentioned in the elaims; it is covered by claims Nos.: 



Remark on Protest [ | The additional search fees were accompanied by the applicant's protest. 

I { No protest accompanied the payment of additional search fees. 
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